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Abstract 



Hyperbolicity is a property of a graph that may be viewed as being a "soft" version of a tree, and 
recent empirical and theoretical work has suggested that many graphs arising in Internet and related data 
applications have hyperbolic properties. Here, we consider Gromov's notion of (5-hyperbolicity, and we 
establish several positive and negative results for small-world and tree-like random graph models. In 
particular, we show that small-world random graphs built from underlying grid structures do not have 
strong improvement in hyperbolicity, even when the rewiring greatly improves decentralized navigation. 
On the other hand, for a class of tree-like graphs called ringed trees that have constant hyperbolicity, 
adding random links among the leaves in a manner similar to the small-world graph constructions may 
easily destroy the hyperbolicity of the graphs, except for a class of random edges added using an expo- 
nentially decaying probability function based on the ring distance among the leaves. Our study provides 
the first significant analytical results on the hyperbolicity of a rich class of random graphs, which shed 
light on the relationship between hyperbolicity and navigability of random graphs, as well as on the 
sensitivity of hyperbolic S to noises in random graphs. 

Keywords: Complex networks, graph hyperbolicity, small-world networks, decentralized navigation 

1 Introduction 

Hyperbolicity, a property of metric spaces that generalizes the idea of Riemannian manifolds with negative 
curvature, has received considerable attention in both mathematics and computer science. When applied to 
graphs, one may think of hyperbolicity as characterizing a "soft" version of a tree — trees have hyperbolicity 
zero, and graphs that "look like" trees in terms of their metric structure have "small" hyperbolicity. Since 
trees are an important class of graphs and since tree-like graphs arise in numerous applications, the idea 
of hyperbolicity has received attention in a range of applications. For example, it has found usefulness in 
the visualization of the Internet, the Web, and other large graphs (30l [3U [36] [35j B3 ; it has been applied to 
questions of compact routing, navigation, and decentralized search in Internet graphs and small-world social 
networks lfl"3l ITOl |25l Q] [26] |H |27l ; and it has been applied to a range of other problems such as distance 
estimation, network security, sensor networks, and traffic flow and congestion minimization J2 \TT\ [TH [T9l 



[33 na. 
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The hyperbolicity of graphs are typically measured by Gromov's hyperbolic 5 fl5l 171 (see Section [2]). 
The hyperbolic 5 of a graph measures the "tree-likeness" of the graph in terms of the graph distance metric. 
It can range from up to the half of the graph diameter, with trees having 5 = 0, while "circle graphs" and 
"grid graphs" having large 5 equal to roughly half of their diameters. 

In this paper, we study the (^-hyperbolicity of families of random graphs that intuitively have some sort of 
tree-like or hierarchical structure. Our motivation comes from two angles. First, although there are a number 
of empirical studies on the hyperbolicity of real- world and random graphs (2j[T7l|33l[32l[37j[T2l, there are 
essentially no systematic analytical study on the hyperbolicity of popular random graphs. Thus, our work 
is intended to fill this gap. Second, a number of algorithmic studies show that good graph hyperbolicity 
leads to efficient distance labeling and routing schemes @ [13] ED El 1211 El, an d the routing infrastructure 
of the Internet is also empirically shown to be hyperbolic [2]. Thus, it is interesting to further investigate if 
efficient routing capability implies good graph hyperbolicity. 

To achieve our goal, we first provide fine-grained characterization of 5-hyperbolicity of graph families 
relative to the graph diameter: A family of random graphs is (a) constantly hyperbolic if their hyperbolic <5's 
are constant, regardless of the size or diameter of the graphs; (b) logarithmically (or polylogarithmically) 
hyperbolic if their hyperbolic <Ts are in the order of logarithm (or polylogarithm) of the graph diameters; (c) 
weakly hyperbolic if their hyperbolic S's grow asymptotically slower than the graph diameters; and (d) not 
hyperbolic if their hyperbolic <5's are at the same order as the graph diameters. 

We study two families of random graphs. The first family is Kleinberg's grid-based small-world random 
graphs ll22l . which build random long-range edges among pairs of nodes with probability inverse propor- 
tional to the 7-th power of the grid distance of the pairs. Kleinberg shows that when 7 equals to the grid 
dimension d, decentralized routing can be improved from 0(n) in grid to 0(polylog(n)), where n is the 
number of vertices in the graph. Contrary to the improvement in decentralized routing, we show that when 
7 = d, with high probability the small-world graph is not polylogarithmically hyperbolic. We further show 
that when < 7 < d, the random small-world graphs is not hyperbolic and when 7 > 3 and d = 1, the 
random graphs is not polylogarithmically hyperbolic. Although there still exists a gap between hyperbolic 5 
and graph diameter at the sweetspot of 7 = d, our results already indicate that long-range edges that enable 
efficient navigation do not significantly improve the hyperbolicity of the graphs. 

The second family of graphs is random ringed trees. A ringed tree is a binary tree with nodes in each 
level of the tree connected by a ring (Figure [TJd)). Ringed trees can be viewed as an idealized version of 
hierarchical structure with local peer connections, such as the Internet autonomous system (AS) topology. 
We show that ringed tree is quasi-isometric to the Poincare disk, the well known hyperbolic space repre- 
sentation, and thus it is constantly hyperbolic. We then study how random additions of long-range links on 
the leaves of a ringed tree affect the hyperbolicity of random ringed trees. Note that due to the tree base 
structure, random ringed trees allow efficient routing within 0(log n) steps using tree branches. Our results 
show that if the random long-range edges between leaves are added according to a probability function that 
decreases exponentially fast with the ring distance between leaves, then the resulting random graph is log- 
arithmically hyperbolic, but if the probability function decreases only as a power-law with ring distance, 
or based on another tree distance measure similar to |23l , the resulting random graph is not hyperbolic. 
Furthermore, if we use binary trees instead of ringed trees as base graphs, none of the above versions is 
hyperbolic. Taken together, our results indicate that 5-hyperbolicity of the graphs are quite sensitive to both 
base graph structures and probabilities of long-range connections. 

To summarize, we provide the first significant analytical results on the hyperbolicity properties of im- 
portant families of random graphs. Our results demonstrate that efficient routing performance does not 
necessarily mean good graph hyperbolicity (such as logarithmic hyperbolicity). 
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1.1 Related work 



There has been a lot of work on search and decentralized search subsequent to Kleinberg's original work (22j 
l23l . much of which has been summarized in the review l24l . In a parallel with this, there has been 
empirical and theoretical work on hyperbolicity of real-world complex networks as well as simple ran- 
dom graph models. On the empirical side, [2J showed that measurements of the Internet are negatively 
curved; ifTTl H~8l [T9l [331 1321 provided empirical evidence that randomized scale-free and Internet graphs 
are more hyperbolic than other types of random graph models; ll37ll measured the average 5 and related 
curvature to congestion; and lfT2l measured treewidth and hyperbolicity properties of the Internet. On the 
theoretical side, one has ll40l[T9l l4l[38l. The most related to our results, and to our knowledge the only prior 
work to obtain significant results for the hyperbolicity of a class of random graph models, is IT381 . which 
proves that with positive probability extremely sparse Erdos-Renyi random graphs are not <5-hyperbolic for 
any positive constant 5. 

More generally, we see two approaches connecting hyperbolicity with efficient routing in graphs. One 
approach study efficient computation of graph properties, such as diameters, centers, approximating trees, 
and packings and coverings for low hyperbolic-<5 graphs and metric spaces ||9l [8j [T3l [TOl [TTJ . In large part, 
the reason for this interest is that there are often direct consequences for navigation and routing in these 
graphs |[T3l [TUl l25l [TTl . While these results are of interest for general low hyperbolic-5 graphs, they can be 
less interesting when applied to small-world and other low-diameter random models of complex networks. 
To take one example, [9] provides a simple construction of a distance approximating tree for <5-hyperbolic 
graphs on n vertices; but the 0(log n) additive-error guarantee is clearly less interesting for models in which 
the diameter of the graph is O(logn). Unfortunately, this O(logn) arises for a very natural reason in the 
analysis, and it is nontrivial to improve it for popular tree-like complex network models. 

Another approach taken by several recent papers assumes a random graph model with small hyperbol- 
icity and then shows that such random graphs lead to several common properties of small-world complex 
networks, including good navigability properties [|5]|27]|28l|29l. While assuming low hyperbolicity in these 
studies makes intuitive sense, it is difficult to prove nontrivial results for Gromov's 5 even for simple random 
graph models that are intuitively tree-like. 

Understanding the relationship between these two approaches was one of the original motivations of 
our research. In particular, the difficulties in the above two approaches lead us to study hyperbolicity of 
small-world and tree-like random graphs. 

Finally, ideas related to hyperbolicity have been applied in numerous other networks applications, e.g., 
to problems such as distance estimation, network security, sensor networks, and traffic flow and congestion 
minimization ||4T1 l20l |2T1 [3~8l T3VT [3ll , as well as large-scale data visualization ||35l . The latter applications 
typically take important advantage of the idea that data are often hierarchical or tree-like and that there is 
"more room" in hyperbolic spaces of a given dimension than corresponding Euclidean spaces. 



Paper organization. In Section [2] we provide basic concepts and terminologies on hyperbolic spaces and 
graphs that are needed in this paper. In Sections [3] and [4] we study the hyperbolicity of small- world random 
graphs and ringed tree based random graphs. For ease of reading, in each of Sections [3] and [4] we first sum- 
marize our technical results together with their implications (Sections 3.1 and |4.1| ), then provide the outline 
of the analyses (Sections |3 . 2| and |4~2]) , followed by the detailed technical proofs (Sections |3 . 3| and |4~3]) , and 



finally discuss extensions of our results to other related models (Sections |3.4| and |4.4[ ). We discuss open 
problems and future directions related to our study in Section [5] 
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2 Preliminaries on hyperbolic spaces and graphs 

Here, we provide basic concepts concerning hyperbolic spaces and graphs used in this paper; for more 
comprehensive coverage on hyperbolic spaces, see, e.g., Q. 

2.1 Gromov's 5-hyperbolicity 

In |[T5l . Gromov defined a notion of hyperbolic metric space; and he then defined hyperbolic groups to be 
finitely generated groups with a Cay ley graph that is hyperbolic. There are several equivalent definitions (up 
to a multiplicative constant) of Gromov's hyperbolic metric space [6]. In this paper, we will mainly use the 
following. 

Definition 1 (Gromov's four-point condition). In a metric space (X, d), given u, v, w, x with d(u, v) + 
d(w,x) > d(u,x) + d(w,v) > d(u,w) + d(v,x) in X, we note 5(u,v,w,x) = (d(u,v) + d(w,x) — 
d(u, x) — d(w, v))/2. (X, d) is called 5-hyperbolic for some non-negative real number 5 if for any four 
points u, v,w,x G X, 5(u, v, w, x) < 5. Let 5(X, d) be the smallest possible value of such 5, which can 
also be defined as 5(X, d) = sup u v w xeX 5(u, v -> w i x )- 

Given an undirected, unweighted and connected graph G = (V, E), one can view it as a metric space 
(V, do), where da(u, v) denotes the (geodesic) graph distance between two vertices u and v. Then, one can 
apply the above four point condition to define its 5-hyperbolicity, which we denote 5 = 5(G) = S(V,dc) 
(and which we sometimes refer to simply as the hyperbolicity or the 5 of the graph). Trees are O-hyperbolic; 
and O-hyperbolic graphs are exactly clique trees (or called block graphs), which can be viewed as cliques 
connected in a tree fashion lfT6l . Thus, it is often helpful to view graphs with a low hyperbolic 5 as "thick- 
ened" trees, or in other words, as tree-like when viewed at large size scales. 

If we let D(G) denote the diameter of the graph G, then, by the triangle inequality, we have 5(G) < 
D(G)/2. We will use the asymptotic difference between the hyperbolicity 5(G) and the diameter D(G) to 
characterize the hyperbolicity of the graph G. 

Definition 2 (Hyperbolicity of a graph). For a family of graphs Q with diameter D(G), G G Q going to 
infinity, we say that graph family Q is constantly (resp. logarithmically, polylogarithmically, or weakly) 
hyperbolic, if 5(G) = O (I) (resp. 0(\ogD(G)), 0((\ogD(G)) c ) for some constant c > 0, or o(D(G))) 
when D(G) goes to infinity; and Q is not hyperbolic if 5(G) = @(D(G)), where G £ Q. 

The above definition provides more fine-grained characterization of hyperbolicity of graph families than one 
typically sees in the literature, which only discusses whether or not a graph family is constantly hyperbolic. 

2.2 Rips condition 

Rips condition 031 13 is a technically equivalent condition to the Gromov's four point condition up to a 
constant factor. We use the Rips condition when analyzing the ^-hyperbolicity of ringed trees. In a metric 
space (X, d), we define a geodesic segment [u, v] between two points u, v to be the image of a function p : 
[0, d(u, v)] — > [u, v] satisfying p(0) = u, p(d(u, v)) = v, d(p(s), p(t)) = \s — t\ for any s, t G [0, d(u, v)]. 
We say that a metric space is geodesic if every pair of its points has a geodesic segment, not necessarily 
unique. In a geodesic metric space (X, d), given u, v, w in X, we denote A(u, v, w) = [u, v] U [v, w] U [w, u] 
a geodesic triangle. [u,v], [v,w], [w,u] are called sides of A(u,v,w). We should note that, in general, 
geodesic segments and geodesic triangles are not unique up to their endpoints. 
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(a) Poincare disk 



(b) Tessellation 
of Poincare 
disk 



(c) Binary tree 



(d) Ringed tree 



Figure 1 : Poincare disk, its tessellation, a binary tree, and a ringed tree. 



In a metric space, it is sometimes convenient to consider distances between point sets in the following 
way. We say that a set S is within distance d to another set T if S is contained in the ball B(T, d) of all 
points within distance d to some point in T. We say that S and T are within distance d to each other if S is 
within distance d to T and vice versa. 

Definition 3 (Rips condition). A geodesic triangle A (it, v, w) in a geodesic metric space (X, d) is called 
5-slimfor some non-negative real number 5 if any point on a side is within distance 5 to the union of the 
other two sides. (X, d) is called Rips 5-hyperbolic if every geodesic triangle in (X, d) is b-slim. We denote 
$Rips(X, d) the smallest possible value of such 5 (could be infinity). 

It is known (see, e.g., lfl4l 171191') that S(X, d) and 8m ps (X, d) differ only within a multiplicative constant. 
In particular, 5(X,d) < 85ja ps (X, d) and 5m ps (X,d) < A5(X,d). Since we are only concerned with 
asymptotic growth of 6(X, d), Rips condition can be used in place of the Gromov's four point condition. 

For an undirected unweighted graph G = (V,E), we can also treat it as a geodesic metric space with 
every edge interpreted as a segment of length 1, and thus use the Rips condition to define its hyperbolicity, 
which we denote as 5ju ps (G). Note that in the case of unweighted graph, when considering the distance 
between two geodesies on the graph, we only consider the distance among the vertices, since other points 
on the edges can add at most 2 to the distance between vertices. 

2.3 Poincare disk 

The Poincare disk (see Figure [jja) for an illustration) is a well-studied hyperbolic metric space. Although 
in this paper we touch upon it only briefly when we study ringed-tree graphs, it is useful to convey intuition 
about hyperbolicity and tree-like behavior. 

Definition 4. Let D = B(0, 1) be a open disk on the complex plane with origin and radius 1, with the 
following distance function: 



(D, d) is a metric space. We call it the Poincare disk. 

Visually, a (hyperbolic) line in the Poincare disk is the segment of a circle in the disk that is perpendicular to 
the circular boundary of the disk, and thus all lines bend inward towards the origin. The hyperbolic distance 
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between two points in the disk of fixed distance in the complex plane increase exponentially fast when they 
moves towards the boundary of the disk, meaning that there is much more "space" towards the boundary 
than around the origin. This can be seen from a tessellation of the Poincare disk, as shown in Figure[jjb). 

2.4 Quasi-isometry 

Quasi-isometry, defined as follows, is a concept used to capture the large-scale similarity between two metric 
spaces. 

Definition 5 (Quasi-isometry). For two metric spaces (X,dx), (Y,dy), we say that f : X — ^ Y is a 
(A, e) -quasi-isometric embedding from XtoY if for any u,v € X, 

^d x (u,v) - c < d Y (f(u),f(v)) < Xd x (u,v) + e. 

Furthermore, if the e neighborhood of f(Y) covers X, then we say that f is a (A, e)- quasi-isometry. More- 
over, we say that X, Y are quasi-isometric if such a (A, e)-quasi-isometry exists for some constants A and e. 

If two metric spaces are quasi-isometric with some constant, then they have the same "large-scale" behavior. 
For example, the <i-dimensional grid Z d and the (i-dimensional Euclidean space M. d are quasi-isometric, 
realized by the (Vd, v / 5/2)-quasi-isometric embedding (x, y) h-> (x, y). As a second example, consider an 
infinite ringed-tree: start with a binary tree (illustrated in Figure[jjc)) and then connect all vertices at a given 
tree level into a ring. This is defined more formally in Section|4j but an example is illustrated in Figure[TJd). 
As we prove in Section [4j the infinite ringed tree is quasi-isometric to the Poincare disk — thus it may be 
equivalently viewed as a "softened" binary tree or as a "coarsened" Poincare disk. 

Quasi-isometric embeddings have the important property of preserving hyperbolicity, up to a constant 
factor, as given by the following proposition. 

Proposition 1 (Theorem 1.9, Chapter III.H of Q). Let X and X' be two metric spaces and let f : X' — > X 

be a (A, e) -quasi-isometric embedding. If X is 5-hyperbolic, then X' is 5' -hyperbolic, where 5' is a function 
of 5, X, and e. 

3 ^-hyperbolicity of grid-based small-world graphs 

In this section, we consider the 5-hyperbolicity of graphs constructed according to the small-world graph 
model as formulated by Kleinberg ll22l . in which long-range edges are added on top of a base grid, which is 
a discretization of a low-dimensional Euclidean space. 

The model starts with n vertices forming a (i-dimensional base grid (with wrap-around). More precisely, 
given positive integers n and d such that n l / d is also an integer, let B = (V, E) be the base grid, with 
V = {(xi,x 2 , ...,x d ) | Xi € {0,1,.. . ,n 1 / d — G [d]}, E = {((xi,x 2 , . . .,x d ), (2/1,2/2, • • - ,2/d)) I 
3j 6 [d], yj = x j + 1 mod u L l d or yj = x j — 1 mod n 1//rf , \fi / j, yi = x^}. Let ds denote the graph 
distance metric on the base grid B. We then build a random graph G on top of B, such that G contains 
all vertices and all edges (referred to as grid edges) of B, and for each node u € V, it has one long-range 
edge (undirected) connected to some node v S V, with probability proportional to l/dsiu^) 1 , where 
7 > is a parameter. We refer to the probability space of these random graphs as KSW(n, d, 7); and 
we let 5(KSW(n, d, 7)) denote the random variable of the hyperbolic 5 of a randomly picked graph G 
in KSW(n, d, 7). Recall that Kleinberg showed that the small-world graphs with 7 = d allow efficient 
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decentralized routing (with 0(log 2 n) routing hops in expectation), whereas graphs with 7 7^ d do not allow 
any efficient decentralized routing (with Q(n c ) routing hops for some constant c) [22]; and note that the 
base grid B has large hyperbolic 8, i.e., 8(B) = Q(n l / d ) = Q(D(B)). Intuitively, the structural reason for 
the efficient routing performance at 7 = d is that long-range edges are added "hierarchically" such that each 
node's long-range edges are nearly uniformly distributed over all "distance scales". 

3.1 Results and their implications 

The following theorem summarizes our main technical results on the hyperbolicity of small-world graphs 
for different combinations of d and 7. 

Theorem 1. With probability 1 — o(l) (when n goes to infinity), we have 

1 

1. 8(KSW(n, d, 7)) = f2((log n) i-B(d+i)+e ) when d > 1 and 7 = d, for any e > independent ofn; 

2. 8(KSW(n, d, 7)) = f2(log n) when d > 1 and < 7 < d; and 

7-2 

3. 8(KSW (n, d, 7)) = ^(m- 1 € ) w/jcti d = 1 and 7 > 3, for any e > independent of n. 

This theorem, together with the results of 1221 on the navigability of small-world graphs, have several 

implications. The first result shows that when 7 = d, with high probability the hyperbolic 8 of the small- 

1 

world graphs is at least c(log n) 1 • 5 ( d + 1 ) for some constant c. We know that the diameter is G(logre) in 
expectation when 7 = d P4l . Thus the small- world graphs at the sweetspot for efficient routing is not 
polylogarithmically hyperbolic, i.e., 8 is not 0(log c log n) -hyperbolic for any constant c > 0. However, 
there is still a gap between our lower bound the upper bound provided by the diameter, and thus it is still open 
whether small-world graphs are weakly hyperbolic or not hyperbolic. Overall, though, our result indicates 
no drastic improvement on the hyperbolicity (relative to the improvement of the diameter) for small-world 
graphs at the sweetspot (where a dramatic improvement was obtained for the efficiency of decentralized 
routing). 

The second result shows that when 7 < d, then 8 = O(logn). The diameter of the graph in this case 
is G (log n) lf34l ; thus, we see that when 7 < d the hyperbolic 8 is asymptotically the same as the diameter, 
i.e., although 8 decreases as edges are added, small-world graphs in this range are not hyperbolic. The third 
result concerns the case 7 > d, in which case the random graph degenerates towards the base grid (in the 
sense that most of all of the long-range edges are very local), which itself is not hyperbolic. For the general 
7, we show that for the case of d = 1 the hyperbolic 8 is lower bounded by a (low-degree) polynomial 
of n; this also implies that the graphs in this range are not polylogarithmically hyperbolic. Note that our 
polynomial exponent — e matches the diameter lower bound proven in ||39l . 

3.2 Outline of the proof of Theorem [I] 

In this subsection, we provide a summary of the proof of Theorem [T] In our analysis, we use two different 
techniques, one for the first two results in Theorem [T] and the other for the last result; in addition, for the 
first two results, we further divide the analysis into the two cases d > 2 and d = 1. 

When d > 2 and < 7 < d, the main idea of the proof is to pick a square grid of size £q (it does not 
matter in which dimension the square is picked from). We know that when only grid distance is considered, 

the four corners of the square grid have the Gromov 8 value equal to £q. We will show that, as long as £q is 

1 

not very large (to be exact, 0((log n) i-s(d+i)+e ) when 7 = d and 0(log n) when < 7 < d), the probability 
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that any pair of vertices on this square grid have a shortest path shorter than their grid distance after adding 
long-range edges is close to zero (as n tends to infinity). Therefore, with high probability, the four corners 
selected have Gromov S as desired in the lower bound results. 

To prove this result, we study the probability that any pair of vertices u and v at grid distance I are 
connected with a path that contains at least one long-range edge and has length at most I. We upper bound 
such ts so that this probability is close to zero. To do so, we first classify such paths into a number of 
categories, based on the pattern of paths connecting u and v: how it alternates between grid edges and 
long-range edges, and the direction on each dimension of the grid edges and long-range edges (i.e., whether 
it is the same direction as from u to v in this dimension, or the opposite direction, or no move in this 
dimension). We then bound the probability of existing a path in each category and finally bound all such 
paths in aggregate. The most difficult part of the analysis is the bounding of the probability of existing a 
path in each category. 

For the case of d = 1 and < 7 < d, the general idea is similar to the above. The difference is that we 
do not have a base square to start with. Instead, we find a base ring of length 0(^o) using one long-range 
edges eo, where £q is fixed to be the same as the case of d > 2. We show that with high probability, (a) such 
an edge eo exists, and (b) the distance of any two vertices on the ring is simply their ring distance. This is 
enough to show the lower bound on the hyperbolic S. 

For the case of 7 > 3 and d = 1, a different technique is used to prove the lower bound on hyperbolic 
5. We first show that, in this case, with high probability all long-range edges only connect two vertices with 
ring distance at most some £q = o{yfn). Next, on the one dimensional ring, we first find two vertices A 
and B at the two opposite ends on the ring. Then we argue that there must be a path V\ B that only goes 
through the clockwise side of ring from A to B, while another path Vj iB that only goes through the counter- 
clockwise side of the ring from A to B, and importantly, the shorter length of these two paths are at most 
O(£o) longer than the distance between A and B. We then pick the middle point C and D of V\ B and VJ^ B , 
respectively, and argue that the S value of the four points A, B, C, and D give the desired lower bound. 

3.3 Detailed proof of Theorem [I] 
3.3.1 The case of d > 2 and < 7 < d 

For this case, let n' = n l / d be the number of vertices on one side of the grid. For convenience, our main 
analysis for this case uses n' instead of n. 

Lemmas for calculation. We first provide a couple of lemmas used in our probability calculation. 
Lemma 1. There exists a constant c\, such that for any k,m £ Z + , we have 



E 



1 (ci lnm 



V1V2 • • • Vk m 

yi+--+vk= m 

yi ,...,y k £Z+ 

where the left side is considered to be for k > m; and for k = m = 1, the right side 0° is considered to 
bel. 
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Proof. For k = 1, it is trivial. For fc = 2, we have 



1 1 \ 2 / 1 1 

UUJ2 - " V LW2] • \m/2] + " ' + (m-l)-lj " [m/2\ \ \m/2\ + "' + T 

mm 

< ci , 

m 

where c\ is roughly 4. 

Suppose the lemma holds for k — 1, with > 3. The induction hypothesis is 



(ci lnm) 



fc-2 



, 1+ ... +Hfc _ 1=m 2/12/2 ■■■yfc-i 



■+2/fc_l=™ 
yi,---,2/fc_l€Z+ 

Since the logarithm function is increasing, we have 

1 < 1 (ciln(m- l)) fc ~ 2 1 (cilnl) fc ' 2 

■ ■ -Vk ~ 1 m — 1 m — 1 1 

i/i,...,« fc ez+ 



<(cilnm) fc - 2 - V — 

Wl+j(2=m W ^ 
I/1.H2SZ+ 

< (ci lnm) 



k _ 2 cilnm 



m 

Therefore the inequality holds for all fc. □ 

Lemma 2. For any constant wjY/j < 6* < 1, ?/jere ex/sto a constant C2 (mary depend on 9), such 

that for any constants k, n' G Z + , m G R, and non-zero Ai, A2, . . . , G R, we have 



+Afc»fc=m 2/1% ' ' " 
,l/ fc e{l,2,...,n'} 

w/iere left side is considered to be if there is no yi, 2/2, ■ • • ,Vk £ {1)2,..., n'} satisfying Ai?/i + A22/2 + 
h A fc y fe = m. 

Proo/ For each tuple (1/1,2/2, ■ ■ , 2/fc-i) G {1,2,..., n'} fe_1 , there is at most one ?/& G {1,2,..., n'} 
satisfying Aij/i + A2J/2 + ■ ■ • + A^ = m. Since < 1 because < 9 < 1, we have 



v ^ 1 1 

Ai»i+-+A fc wfc=m y l y 2 % yil ..., yfc _ l6 {i,2 ) ... 1 n'} 2/1 % Vk ^ 

1/1,- ,w fc e{l,2,...,n'} 
. 1 

where C2 is roughly (333) 1-9 - Therefore the lemma is proved. □ 
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Classification of paths. In a d-dimensional random graph KSW(n, d, 7), there are two kinds of edges: 
grid edges, which are edges on the grid, and long-range edges, which are randomly added. 

Fix two vertices u and v, a path from uto v may contain some long-range edges and some grid edges. 
We divide the path into several segments along the way from u to v: (a) each segment is either one long- 
range edge (called a long-range segment) or a batch of consecutive grid edges (called a grid segment); and 
(b) two consecutive segments cannot be both grid segments (otherwise combining them into one segment). 

We use a d-dimensional vector to denote each edge, so that the source coordinate plus this vector 
equals to the destination coordinate module n' . For grid with wrap-around, there may be multiple vec- 
tors corresponding to one edge. We choose the vector in which every element is from { — [ttJ, — LtJ + 
1, . . . , L^^^J }• m this wa Y> the vector representation of each edge is unique and the absolute value of every 
dimension is the smallest. We call this the edge vector of that edge. For every segment in the path, we call 
the summation (not module n') of all edge vectors the segment vector. For a vector (x±, X2, ■ ■ ■ , Xd), define 
its sign pattern as (sgn(xi), sgn(x2), . • • , sgn(x c ;)). 

We say two paths from u to v belong to the same category if (a) they have the same number of segments; 
(b) their corresponding segments are of the same type (long-range or grid segments); (c) for every pair of 
corresponding long-range segments in the two paths, the sign patterns of their segment vectors are the same; 
(d) for every pair of corresponding grid segments in the two paths, their segment vectors are equal; and (e) 
the summations (not module n') of all segment vectors in the two paths are equal. 

The last condition is only used to distinguish paths that go different rounds in each dimension on grid 
with wrap-around. 

In one category, there exist paths of which the long-range edges are identical but the grid edges may be 
different. To compute the probability of existing a path in a category, we only need to consider one path 
among the paths with identical long-range edges, since grid edges do not change probabilistic events and 
thus one such path exists if any only if other such paths exist. 

We also assume that there are no repeated long-range edges in every path. For a path that has repeated 
long-range edges, we can obtain a shorter subpath without any repeated long-range edges so that the original 
path exists if and only if the new one exists. Since we are going to calculate the probability about paths not 
exceeding some length, it is safe to only consider paths without repeated long-range edges. 

Lemma 3. There exists a constant C3 (dependent on d) such that for any fixed £, the number of categories 
of paths from u to v of length £ is at most c%. 

Proof. For each edge on the path, if it is a grid edge, it could be in one of the d dimensions, and in each 
dimension it could be in one of the two opposite directions, and thus a grid edge has 2d possibilities. If 
the edge is a long-range edge, on each dimension its sign has three possibilities (+1, 0, —1), so totally 3 d 
possibilities for the sign pattern of the long-range segment vector. Moreover, in the grid with wrap-around, 
we consider each wrap-around of the path on some dimension to be one round in that dimension. Then the 
path can go at most 2£ + 1 different numbers of rounds on each dimension (ranging from £ rounds in one 
direction up to £ rounds in the other direction), so the summation of all segment vectors has at most (2£ + l) d 
different values. The choice of each edge out of 2d + 3 d possibilities and the total summation of segments 
vectors determine a category. Therefore, the number of categories is bounded by (2d + 3 d ) e (2£ + l) d < c% 
for some C3. □ 

The above bound on the number of categories is not tight enough to be used for later analysis, when 
the number of long-range segments are small. Thus, we further bound the number of categories in the 
following way. 
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Lemma 4. There exists a constant C4 (dependent on d), such that for any fixed £ < n' and k with 1 < k < £, 
the number of categories of paths from u to v of length £ and having k long-range segments is at most 



Proof. For a path from u to v of length I and containing k long-range segments (1 < k < £ < n'), the 
summation of all segment vectors has at most (2k + l) d choices. This is because the path can go at most 
2k + 1 different number of rounds on each dimension (k rounds in one direction to k rounds in the other 
direction). We consider the number of categories for a fixed summation of segment vectors first. 

Suppose there are t grid segments, each having ai, 02, . . . , a* edges respectively (t < k + 1, ctj > 1, 
«i + 0,2 + ■ ■ ■ + at < £). If t = 0, then k = £, and it is easy to see that there are at most (3 d ) k categories. 
Suppose now t > 1. For the 2-th grid segment with ai grid edges, its segment vector is such that on each 
dimension the only possible values are — ai, —ai + 1, . . . , 0, . . . , a, — 1, a,. Thus, the number of possible 
segment vectors is (2<n + l) d < 3 d af. Since each long-range edge has 3 d possible sign patterns, the number 
of categories for fixed t and ai, a 2 , . . . , a t is at most (3 d ) k Ylt=i 3 d af < 9 ( - k+1 ^ d (£/t) td , where the inequality 
of arithmetic and geometry means is used. 

The tuple (ai, 02, ... , ai) has less than £ l possibilities. Considering (2k + \) d different possibilities 
of segment vectors summations, the total number of categories from utov with length £ and containing k 
long-range edges is at most 



Probability calculation. We first give a lemma to calculate the probability of the existence of a specific 
edge. For an integer x, we define x to be |x| if x 7^ and 1 if x = 0. We also define function f(n') as 
follows: 



Lemma 5. For two vertices u and v, the probability of the existence of a long-range undirected edge between 
u and v is at most c^(x\ ■ x~2~- ■ •x~2)~d/f(n'), where C5 is a constant depending only on d and 7, and 
(x±, X2, • • • , Xd) is the edge vector if there exists a long-range edge from u to v and depends only on u 
and v. 





where C4 is a constant depending only on d. 



□ 




Proof. Say the non-zero elements of (a?i, X2, ■ ■ ■ , Xd) are (xj i; xi 



',' < d). Let p be the proba- 
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bility to add an edge from u to v. Then 

Oil + \x 2 \ h — h \xd\y 1 



p 



< o 




Moreover, the edge may also be from v to u, which also has probability p. By union bound the proba- 
bility of the undirected edge (u, v) is 0(p). □ 

The following lemma gives the probability that one edge jumps within a local area. 

Lemma 6. For a vertex u and a long-range edge (u, v)from u, the probability that the grid distance between 
u, v is less than s is at most CQf(s) / f(n'), where cq is a constant depending only on d and 7. 

Proof. By union bound, the probability is at most 



s 



*-T \ ^ /Ai^-T] f(s) 



where cq is a constant. □ 

Given a path category C, we now calculate the probability of existing a path in C. 

Lemma 7. Given a path category C with length £ and k long-range edges. The probability that there exists 
a path in C is at most 

4(c*k k ) d I \lnn') k 7 = d, 

c , c ( fc _ 1)(d _ 7) / (n , )d - 7 o < 7 < d; 

where c 2 and C5 are the constants given in Lemma^and cj is another constant. 
Proof. Let the segment vectors of the long-range edges in a path P G C be 

(Xn, X 12 , • • • , Xld), (x 2 l,X 22 , • • • , X 2 d), • • • , (sCfcl, X k 2, ■■ ■ , Xkd)- 

By our definition of a category, all paths in C have the same sign patterns on the corresponding long-range 
segment vectors. Thus, we can define the following sets for the category C: Af = {j \ Xji > 0, 1 < j < k}, 
A~ = {j I Xji < 0, 1 < j < k}, and A® = {j \ Xji = 0, 1 < j < k}, for all 1 < i < d. For fixed u and 
v, there is a fixed vector (ti, t 2 , ■ ■ ■ , td) such that the summation of the segment vectors of all long-range 
segments in any P G C is vector {t\,t 2 , . . . ,td). This is because the summation of all segment vectors 
from u to v is fixed, and all grid segments have the fixed segment vectors. Therefore, a path in C can be 
characterized by kd integers in, . . . , Xkd satisfying the following for all 1 < i < d: 

( Xji G {l,...,n} for j G Af , 

Xji£ {-n',...,-l} for j£A^, 

forj G 4°, (1) 



\ X ji\ 1^*1 ~~ ^' 



v jeA: 
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The probability that some path exists is the multiplication of the probability of the first edge, the probability 
of the second edge conditioned on the existence of the first edge, the probability of the third edge conditioned 
on the existence of the first two edge, etc. In our model, the probability of an (undirected) edge conditioned 
on the existence of other (undirected) edges is less than or equal to the probability without condition, because 
each vertex can only connect to exact one other vertex (when considering the edge direction). Hence we can 
use the multiplication of the probabilities of all edges as an upper bound of the probability of a path. By 
union bound and Lemma [5] the probability that a path exists in C is at most 



E n 

all paths in C j=l 



C 5 (Xji ■ Xj 2 ■ ■■Xjd)' 

fin') 



< 



f k (n') 



/*(w 



satisfying U] 



8=1 



E 



^ forjgA?, SjJ= 1; for je AT UAT , xj^e{l, — ,n'} 



— -5 — . TT 

nm) 11 



E 



xjl (J ranges in AT UA. ): 



i =t i ;anda: J - i e{l,...,ii'} 



i&AT j& a: 



k 

i=i 



n 



(2) 



The inner brace summation is considered to be 1 for Af = A i =0. Now we consider the following sum 
for disjoint sets A + , A~ C {1, 2, . . . , k} (at least one is not empty) and numbers t G Z, 7 > 0, d > 0, 



E 



yj {j ranges in A^UA ) : 
E Vj- E yj=t;andj/ J -6{l,2,...,n'} 
j6A+ ieA- 




(3) 



Case 7 = d. 

If one of A + and is 0, we only need to consider the case that t 0, because the sum (D is for 
i = 0. By Lemma[TJ the sum Q is bounded by 



C ?^<^max { ^} = (c lr c/ e ) fc . 

\t\ x>l X 



If neither A + nor A is 0, we assume that t > 0. (The calculation for i < is similar if we exchange 
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A + and A .) By Lemma[T]the sum (|3]> is bounded by 

\ / 



1 



fen' / 



E 

E %=s+t 



n w 



=1 

fen' 



(ln(s + t)) fc 
i »(«+*) 

(ln(s + t)) fc 



< cfE 

8=1 

fen' 

= 

s=l x 

< c\ • max{ ^fl } • V 



In either case, the sum ^ is bounded by Cjk k for some constant cj. By (|2j) the probability of a path 
in C is at most c|(cf k k y r /(lnn') k . 

Case < 7 < d. By Lemma |2| the sum Q is bounded by 

(c 2 n') (|A+l+|A_| - 1)(1 -i ) < (c 2 n') (fc - 1)(1 -2). 
Therefore, by (|2]) the probability of a path in C is at most 



( n /)fe(d- 7 ) 



(c2n ) 



- _ 4c 2 fc " 1)(d - 7) 



(n') 



Ad-7 



□ 



Finally, we apply Lemmas [3] |4| [6j and [7] together to show that the probability of any two vertices 
connected by a short path with at least one long-range edge is diminishingly small. 

Lemma 8. For any two vertices u and v, the probability that a path exists connecting u and v with at least 

i 

one long-range edge and total length at most £ is o(l), when I < (log n') i-s(d+i)+e (e > O)for 7 = d; and 
I < c log n' ( c is some constant depending only on d and 7) for < 7 < d. 

Proof. We first study the probability of a path with exact length I, and we divide it into the following two 
cases. 

• Case 7 = d. If k = 1, the probability that a path from u to v with length £ exists is at most 
0(£ d CQ ln(3£) I In n') = 0(£ d ln£/ In n'). To see this, we divide the path into three segments: a grid 
segment followed by one long-range edge, then followed by another grid segments. The first grid 
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segment can reach at most 0(1 ) destinations. For each of this destination w, the long-range edge 
has to reach some vertex within grid distance £ of vertex v. And by triangle inequality, it must be 
within grid distance 3£ of vertex w since the distances between v, u and u, w are both at most L By 
Lemma|6j we know this probability is c% ln(3£) / In n' . Therefore, the above statement holds. 

For k > 2, we simply combine Lemmas [4] and [7] Thus, the probability that a path with length £ exists 
is at most 

\ In n' I 




^ ( Inn' ) 



Inn' J ^— ' \ Inn' / 

i 

This probability is o(l) when £ < (log re') i-5(«n-i)+s for any e > 0. 

• Case < 7 < d. In this case, we combine Lemmas [3] and [7] The probability that a path with length £ 
exists is at most 

C3 • max < — ; — — i > <- — - — jr-j , 

i<k<l y (n') d ~ 7 J ~ (n') d_7 

where the inequality is based on 02,05 > 1, which is obviously the case. This probability is o(l) when 
£ < c log n' for some properly chosen constant c, which depends only on d and 7. 

We now consider the case of path length less than £. Let wbea grid neighbor of v. For any path 
connecting u and v with length less than £, we can add grid edges (v, w) followed by (w, v), to increase the 
path length to either £ — 1 or £. Thus, the probability that a path of length at most £ exists is the same as the 
probability that a path of length £ — 1 or £ exists. The case of £ — 1 follows exactly the same as the case of £ 
argued above. Therefore, the lemma holds. □ 

With Lemma [8] we are ready to prove Theorem [T] for the case of d > 2 and < 7 < d. 

Proof of Theorem^for the case of d > 2 and < 7 < d. We define £ = [(log n') 1 -^ d + 1 )+' /2\ when 7 = 
d, and £ = [clogn'/2j when < 7 < d, where c is the constant determined by Lemma[8] In the base grid, 
find any square in any dimension with one side length equal to £. Consider the four corner vertices of the 
square. By Lemma[8] the probability that a pair of corners of this square are connected by a path with at least 
one long-range edge and total length at most 2£ is o(l). Thus by union bound, the distances between any 
pair of these four corner vertices are exactly their grid distance, with probability 1 — o(l). This means that 
the S value of these four vertices is £, with probability 1 — o(l). Therefore, we know that with probability 

1 - o(l), S(KSW(n,d,j)) = ^((logn') 1 - 5 ^ 1 ^) = ft ((log n) ) when d > 2 and 7 = d, and 

5(KSW(n, d, 7)) = ft(logn') = ft(logn) when d > 2 and < 7 < d. □ 



Remark (The limitation of this approach). We already have tight lower bound for < 7 < d. However, 

1 

the lower bound ft((logn) 1 - 5 ( d+1 )+ e ) for 7 = d is still not matching the upper bound O(logn). We show 
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below that the lower bound can never be improved to f2(logn) by the above technique of proving that the 
grid distance is the shortest on every small square (with high probability). 

Consider a KSW graph with 7 = d. For an £ x I square S, let u be its the upper left vertex and v be 
its lower right vertex. Similar to the proof in Lemma [6] the probability of existence of an edge linking any 
w into the 1/2 x 1/2 square on the lower right side of w within the square S (but excluding w's two grid 
neighbors in the square) is 

Consider the i/2 x 1/2 square on the lower right side of u, which is the upper left quadrant of the original 
square. The probability that at least one vertex w in this quadrant links to its lower right 1/2 x 1/2 square 

is 1 - (1 - q y/ 2x£ / 2 . This probability is almost 1 when £ = u (y ^ogiog!' ) for d = 2, or I = u^^log^ 7 ) 
for d > 3. If there exists such a vertex w in the upper right quadrant, and suppose it links to a vertex x in its 
lower right 1/2 x 1/2 square. Then x must also be in the original square S, and we can have a path from u 
to w following the grid path, then the long-range edge from w to x, and then the grid path from x to v. This 
path connects u and v and must be shorter than the grid paths from u to v. Therefore our technique cannot 

improve the lower bound to u (^/ i gk>gn' ) = u (\ Aogk>gn ) for d = 2 01 ^(V^grH) = uj^logn) for 
d > 3. 



3.3.2 The case of d = 1 and < 7 < 1 

In this section we give lower bounds of 5 for the one dimensional KSW model (based on an n- vertex ring). 

1 

Let the n vertices be vq, u n -i- Let ^0 = [(logra) 15(ti+1)+e J when 7 = 1, or [clognj when < 7 < 1, 
where c is the number determined in Lemma [8] 

The idea is to find a long-range edge eo between two vertices with grid distance Iq. As eo forms a local 
ring with the original grid, we can give a lower bound of 5 such that the ring distances (with respect to the 
grid edges and eo) are the shortest even after adding the long-range edges. We first calculate the probability 
of ring distances being the shortest under the condition of existing eo- 

We divide the construction of a KSW graph into two stages: 1) Every vertex links to exactly one other 
vertex according to some distribution; and 2) we ignore the edge direction and consider the graph as undi- 
rected. Let £i (0 < i < n — 1) be the event that v% links to V( i+ £ mo( j n ) in the first stage. Under the condition 
that £i happens, V( i+ e mQ( j n \ is still free to link to any vertex. The events £\, . . . , £ n -\ are independent. 

Lemma 9. Under the condition that £{ happens (vi links to vu + t mo( j n \), for any two vertices u, w on the 
curve from Vi to v^ + £ moc j n \ (exclusive), the conditional probability of existing a path connecting u and w 
with at least one long-range edge other than (vi,V( i+ £ m0( j „)) and path length at most £q is o(l). 

Proof. Let eo be the edge (v{, i>(i+£ mod n))- We still follow the arguments for d > 2 and < 7 < d, but 
change the classification of paths slightly: now eo is a type of edges by itself, and thus together with grid 
edges and long-range edges, we have three types of edges and three types of corresponding segments. For 
each original category of paths with length £ and k long-range segments defined in the previous section, 
we further divide it into at most k + 1 categories, based on whether eo was the first, second, or the k-th 
long-range segment, or eo does not appear in the path. The number of long-range edges in a category with 
eo is decreased by 1. For the categories with only grid edges and eo, those paths now have no long-range 
edge and they exist with conditional probability 1 , hence they will not be calculated. 
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For Lemma [3} the number of categories is increased by a factor of at most I + 1. For Lemma [4] the 
number of categories is increased by a factor of at most k + 1. Thus, we only need to properly adjust the 
constants C3 and C4 in these two lemmas to make them still hold. Lemma [5] is not changed except that eo 
is no longer a long-range edge and the lemma is not applicable on eo. Lemma [6] is not affected. In the 
proof of Lemma [7] a category without eo can be calculated normally. For a category with eo, the number of 
long-range edges is decreased by 1 as stated above. One can see that all the arguments still hold by changing 
the summation of long-range edges (tj in ([T}) to contain eo- 

For Lemma[8j only the case 7 = d and k = 1 (one long-range edge) needs some modification. If the path 
does not contain eo, the calculation still holds. Otherwise, we can divide the path into five segments: grid, 
long-range, grid, eo, grid (or grid, eo, grid, long-range, grid). We just consider the 3 consecutive segments 
grid, eo, grid as a whole, which contains 0{i) edges and can reach 0(£ d ) = 0(£) destinations. One can see 
the previous argument still holds. Therefore, the consequence of Lemma[8]still holds. □ 

With Lemma [9j we give the proof of Theorem [T] for the case of d = 1 and < 7 < 1. 

Proof of Theorem^for the case of d = 1 and < 7 < 1. Let T be the event 5 > \t§ — 3. We first show 
that Pr{^ I Si} = 1 - o(l). Pick four vertices A { = v {i+[ i eoj modn) , B t = w (i+L | /oJ modn) , Q = 
f (i+LfftJ mod n) and D i = v (i+llto\ mod «)• B Y Lemma|9]and union bound, we know that with probability 
1 — o(l), the distances between every pair of vertices are the ring (grid edges plus (vj, vi i+ £ moc j „))) dis- 
tances. That is, the distances between AiBi, B; t Ci, CiDi, DiAi are roughly \Iq (off by at most 2), and the 
distances between AiCi and B{Di are roughly (off by at most 1). Therefore, by considering the four 
vertices AiBidDi, we have 5 at least \ — 1 + \io — 1 — \lo — 2 — — 2) = \£q — 3 with conditional 
probability 1 — o(l). 

For every £i, the probability that v% links to V( i+ £ mo d n) i s 



2+s . 



Pr{£:.i = G(V-)= @((l0gnr ^ ) 7 = 1 ' 

7(«) [eilogn/n 1 -') 0<7<1. 

Define this probability as q. We have PrjJ 7 and £i} = PrjJ 7 | £i} Pr{£i} = (1 — o(l))q. 

Let K be the random variable denoting the number of £^s that occur. We define m = E[K], and 
we have m = nq. One can check that m = poly(n) for both cases 7 = 1 and < 7 < 1. By 
Chernoff Bound, Pr{\K - m\ < m a6 } > 1 - 2e- { - m '" A ^ = 1 - 2 e - m °' 2 / 4 . Hence with very 
high probability, K is close to m. Let Q denote the event that m — m 6 < K < m + m 6 . We show 
that PrjJ 7 and £i and Q} is very close to PrjJ 7 and £{]. In fact, PrjJ 7 and £i and Q} > PrjJ 7 and £1} — 
Pr{not G] = (1 - o(l)) Pr{J" and £ { }, because Pr{not Q} < 2e~ m ° ' 2 / 4 = 2e~P ol y^ is much smaller than 
PrjJ 7 and £{} = (1 — o(l))q = uj{\/n). On the other hand, it is straightforward that Pr{ F and £{ and Q} < 
PrjJ 7 and Hence PrjJ 7 and £i and Q} differs from PrjJ 7 and £{} by a factor at most (1 — o(l)), and 
Prjj 7 and £ { and 9} = (1 - o(l))q. 

For every r = r$r\ ■ ■ ■ r n _i € {0, l} n , define to be the event that £j happens iff rj = 1 for all 
j = 0, 1, . . . , n — 1. We can see % r 's are mutually exclusive for r G {0, l} n . Hence 

m+m 0,6 

Pr{ J 7 and £ t and = ^ ^ Pr^and^}. 

fc=m-m - 6 r G {0,i}« 

rA—1 and r has fc l's 
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Therefore 



m+rrt 0,6 

Pr{ T and G} = PrjJ 7 and H r } 



k=m-m - 6 re{o,i}" 

r has k I's 



m+m 0-6 n—1 

= E r-E E Pr^andW,} 



k 

k=m-m - 6 i=0 re{o,i}" 

Tj — 1 and r has k I's 



n—1 m+m 0,6 

>^M'E E E P^andH,} 



m + mr 

i=0 fc=m.- m 0.6 re{0,l}« 

r,- — 1 and r has fc I's 



n-1 

m + m r " 



l-^^Pr{J- and ^ and fi?} 

8=0 

n(l-o(l))g m(l-o(l)) 



l-o(l). 



m + m - 6 m + m - 6 
The probability that S > \i Q - 3 = 0(4) is Pr{ J 7 } > Pr{ J" and Q} = 1 - o(l). □ 

3.3.3 The case of d = 1 and 7 > 3 

We first show that with high probability all long-range edges connect two vertices with grid distance o(n), 
for general d and 7 > 2d. 

Lemma 10. In a random graph from KSW(n,d,^) with 7 > 2d, with probability 1 — o(l) there is no 

long-range edge that connects two vertices with grid distance larger than n~'- d+£ , where e is any positive 
number. 

Proof. For a vertex u, the probability that the long-range edge from u links to somewhere with distance 

1 I c 

longer than £q = ni- d from u is 

(n 1 / d _ \ ( 00 \ 

i=e l~>3=\ 3 J '/ \i=e j 
By union bound, the probability that such u exists is 



O 



(nl d -^ =o(n- n ( ^ +£)(d - 7) ) = O (n< d ~^ = o(l). 



Therefore with probability 1 — o(l) such u does not exist. □ 

Given a graph G in KSW(n, d, 7), let £q(G) be the largest grid distance of two vertices connected by a 
long-range edge in G. From the above result, we know that when 7 > 3d, £o(G) < n 7 i^ d+£ = o(V n l l d ) 
with high probability. For the rest of this section, with d = 1, we fix G to be any graph in KSW(n, 1,7) 

with £q(G) < n^ I+£ , and show that 5(G) = $7(n c ) for some constant c. Since G is fixed, we will use £q 
to be the short hand of £q(G). 
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Now go back to the one dimensional grid with wrap-around, which is a ring with vertices vq , v\ , . . . , v n -\. 



Notice that in one-dimensional case, the edge vector defined in Section 3.3.1 degenerates to a scalar value 
from { — [f J , — |_f J +!)•••) L^i^J }■ We arrange vq,v\, .. . , v n -\ clockwise on the ring. Then a pos- 
itive edge scalar corresponds to a clockwise hop while a negative edge scalar corresponds to a counter- 
clockwise hop. 

Let A = vq and B = i>i n /2j be two specific vertices. We define two kinds of paths between A and B: 
a positive path is one in which the summation of edge scalars (not taking module n) is positive, while a 
negative path is one in which the summation of edge scalars (not taking module n) is negative. 

Lemma 11. There exists a positive path from AtoB that does not go through v^ n /2\+i, v\n/2\+2> ■ ■ ■ > v n-i, 
and the length is at most 2£q longer than the shortest positive path from A to B. Similarly, there exists a 
negative path from A to B that does not go through vi, v 2 , • • • , v \ n /2\—h an d the length is at most 2£q longer 
than the shortest negative path from A to B. 

Proof. We only give the proof for the positive path case. Consider any shortest positive path V from A to B. 
We first show the following claim. Let Sa = • • • > ve Q } be the set of £q consecutive vertices clockwise to 
A, and Sb = {i>i n /2j-4, ^[n/2J-4)+i> ' ' ' ' v Yn/2\-i\ be the set of £q consecutive vertices counter-clockwise 
toB. 

Claim. There must exist a subpath in V from a vertex u E Sa to a vertex w G Sb that does not go 
through v [n/2i+1 , v [n/2i+ 2, . . . ,v n -i. 

Say there are m edges in V. Let Si (0 < i < m) denote the summation of the first i edge scalars in 
V (not taking module n). Initially we have so = 0, and the final value s m is kn + [n/2\ for some integer 
k > 0. One can also see that the position after going through the first i edges in V is at vr s . mo( j n y Let i\ 
be the smallest integer such that > [n/2\ (exist because s m > [n/2\ ), and i 2 be the largest integer such 
that i 2 < h and Sj 2 < (exist because so = 0). 

We consider the (i 2 + l)-th edge, which begins at V( s mod n ) for some Sj 2 < 0, and ends at v ( Si +1 modn) 
for some Sj 2+ i > 0. The number Sj 2+ i is at most Sj 2 + £q since no edge is longer than £$. Therefore, 
Si 2 +i must be a number in (0, £q] and «( s<2+1 mod n) must be in Sa- We choose u = W( s<a+1 mod n) £ Sa- 
Similarly, pick w = s^-i, which is the beginning point of the ii-th edge, we have w G Sb- The intermediate 
values Sj 2+ i, Sj 2+ 2, • • • , s^-i are all in the interval (0, L n /2j) by the definitions of i\ and i 2 . That is, for 
all j such that i 2 + 1 < j < i\ — 1, Sj mod n = sj, and the corresponding vertex V( Sj mo dn) = v Sj G 
{vi, v 2 , . . . , ul„/2J-i}- Therefore the subpath from utow does not go through ui n /2j+i, ^L™/ 2 J+2' ■ ■ ■ > w n-i. 
and the claim holds. 

With the claim, we can construct a positive path V' , which use ring edges from A to u, then use the 
subpath in the claim from u to w, and then from w to B using ring edges. The length of V 1 is at most 2£§ 
longer than V, the shortest positive path from A to B. □ 

We use V\ B and V~ AB to denote the two paths stated in the above lemma. According to this lemma, one 
of V\ B and V^ B is at most 2£q longer than the shortest path between A and B. 

Let C be the middle point of V\ B (take a vertex nearest middle if the path has odd number of edges), 
and V\ c , Vq B be the two subpaths from A to C and C to B. Similarly, let D be the middle point of V~a b 
and Pad' ^db b e the two subpaths. 

Lemma 12. The paths V\ c , Vq B , V^ d and Vf )B are at most 3£o + 1 longer than the shortest paths between 
corresponding pairs of vertices. 
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Proof. We only give the proof for V\ c . Suppose that it is not true, and the shortest path V* AC from A to C 
is at least 3/o + 2 shorter than V\ c . The path V* AC must be at least 3£o + 1 shorter than Vq b because C is 
the point nearest middle of V\ B . 

Consider the last time that the path V AC gets into the range {vq, v\, . . . , v\n/2\ }» the subpath of V* AC 
from that point to C must be one of the following cases. 

• It is a subpath from a vertex A! G {vq,v\, . . . ,vg } to C not going through ui n /2j+i, ^|n/ 2 J+ 2 ' 
. . . ,v n -\. Replace V\ c by the path from A to A' through ring edges concatenated with the subpath 
of V AC from A 1 to C. This will cause the length of V AB to decrease by at least 3£o + 2 — £o > 2£o, 
which is impossible by Lemma [TT] 

• It is a subpath from a vertex B' £ {v [ n /2\-e , W|n/2j-4+i) • • • > w |n/2j} t0 @ mat ^ oes not §° through 
wi n /2j+i> u |n/2j+2> •••>^n-i- Replace Vq B by the reverse of this subpath of V* AC from C to 2?' 
concatenated with ring edges from B' to B. This will cause the length of V AB to decrease by at least 



3£o + 1 — 4> > 2^o, which is impossible by Lemma [TT 
Therefore the lemma holds. □ 
Then we consider the shortest path between C and D. 

Lemma 13. Either the concatenation ofV^, B and reversed V BB , or the concatenation of reversed V AC and 
V BA is at most 8£q + 2 longer than the shortest path between C and D. 

Proof. The shortest path from C to D (say Vq D ) must go through either B's neighborhood V[ n / 2 \ , ^|n/2j+i» 
V[ n /2\+e Q or A's neighborhood vo,vi, . . . ,vi . Without loss of generality, we assume that it goes 
through the point B' in B's neighborhood. Use Vq B , and V B , D to denote the two subpaths from C to 
B' and B' to D respectively. They must also be shortest paths of CB' and B'D. 

The path Vq B , is at most £q shorter than the shortest path between C and B, otherwise the path Vq B , 
concatenated with ring edges from B' to B would be shorter than the shortest path. Similarly, V* B , D is at 
most £q shorter than the shortest path between B and D. Therefore the shortest path between C and D 
is at most 2£$ shorter than the concatenation of shortest paths of CB and BD. Then by Lemma [T2j the 
summation of Vq B and V D \ B is at most 2(3^o + 1) + 2£o = 8£o + 2 longer than the shortest path between 
CandD. ' □ 

We have the following corollary since the two paths in this lemma differ by at most 2 considering the 
length. 

Corollary 14. The concatenation ofV^ B and reversed V BB , and the concatenation of reversed V AC and 
V BA are both at most S£q + 4 longer than the shortest path between C and D. 

Now we can prove the lower bound of 5 for this case. 

Proof of Theorem^for the case of d = 1 and 7 > 3. Consider the four points A, B, C and D defined above. 
Let d(x, y) denote the distance between vertices x and y. We can see the following consequences about pair- 
wise distances: (a)d(A,B) > [n/2\/£ ;(b)d{C,D) > \V^ B \ + \V^ B \-{8e + 4) > \Vac\- 1 + \' P db\- 
(8£ + 4) > d(A, C) + d(D, B) - (8£ + 5), where the first inequality is due to Corollary [H] and (c) simi- 
larly, d(C, D) > d(A,D) + d(C,B)-(8£ + 5). Therefore, we have both d(A, B) + d(C, D) > d(A,C) + 
d(D, B) + [n/2\ /£ - (8£ + 5), and d(A, B) + d(C, D) > d(A, D) + d(C, B) + [n/2\ /£ - {8£ + 5). 
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1 I 

For £q < jit- 1 with any sufficiently small e > and sufficiently large n, we have |_rc/2j/% >> 
8£ + 5, and thus d(A, B) + d(C, D) is the largest distance pair. In this case, 5 > [n/2\ /£ - (8£ + 5). 
By Lemma liol with probability 1 — o(l) there is £q < n^~ I+£ . Therefore, with probability 1 — o(l), 

5(i<f,SW(n, 1,7)) = ^(n/nT- 1 ) = ^(nT- 1 ) for d = 1, 7 > 3 and any sufficiently small e > 0. Since 

7-2 E 7-2 c / 7-2 

for any e > e > 0, n - '- 1 = ^(ni- 1 ), we have 8(KSW(n, 1, 7)) = llfni- 1 ) for any e > 0. □ 



3.4 Extensions of the KSW model 



Our analysis also holds for some variants of the KSW model. In this section, we study one variant of the 
underlying structure: grid without wrap-around; and two variants of edge linking: multiple edges for each 
vertex and linking edges independently. 

Grid without wrap-around. We modify our analysis so that the first two results of Theorem [T] still hold. 
For the case of d > 2 and < 7 < d (Section 3.3.1 1, the changes are as follows. For a path from u to v, 
we divide it into segments as before. Elements in an edge vector are in {— n', —n' + 1, . . . , n' — 1, n'} now. 
Recall the last condition that we define two paths from u to v belong to the same category: the summations 
(not module n') of all segment vectors in the two paths are equal. This is always satisfied for grid without 
wrap-around, because the summation of all segment vectors depends only on the positions of u and v. In 
the proofs of Lemma[3]and Lemma[4j the summation of all segment vectors is fixed rather than (2£ + l) d or 
(2k + l) d choices respectively. Hence the upper bounds given in Lemmas [3] and [4] still hold. In Lemma[5] 
an edge between u and v can be from u to v or from v to u. The probabilities of the two cases may differ 
by a constant factor on grid without wrap-around. Hence the probability of existing an edge between u 
and v is changed by at most a constant factor, and Lemma [5] still holds. One can verify the rest analysis in 
Section 3.3.1 still hold. For the case of d = 1 and < 7 < 1 (Section 3.3.2 1, the only change is that event 
£ i, which is the event that V{ links to v i+ i , only applies when i = 0, 1, . . . , n — £q — 1. Since £q = 0(log n) 
is much smaller than n, there are still almost n events £i and the argument has no significant change. Hence 
Theorem [T] still holds for the cases that d > 1 and < 7 < d. 



Multiple edges for each vertex. In this model, each vertex links a constant, say do, number of edges 
according to the same distribution that u links to v with probability ^^ds'iu v')~i • ^ e snow tri at all our 
analysis still hold with some slight changes. For the case of d > 2 and 1 < 7 < d (Section 3.3. 1 ), Lemma[5] 
still holds since the probability of the edge (u, v) is increased by at most c?o times using union bound. And 
the proof of Lemma 6 still works, because 0(i d_1 ) is an upper bound of the probability that u links to 
some vertex at distance i by union bound. For the case of d = 1 and < 7 < 1 (Section [332] ), we define 

£i as the event that at least one of u^'s edges links to V( i+ £ mod n y One can see Pr{£j} is still 0( j^y) and 
the rest argument also holds. For the case of d 

" ' that 0{i d " 1 



the same reason as Lemma 



1 and 7 > 3 (Section 3.3.3 1, Lemma 10 still holds for 



2^=1 J 



is an upper bound of the probability that u links to 



some vertex at distance i. One can verify all results of Theorem [T] still hold under this change. 
Linking edges independently. In this model, all edges exist independently. The edge between (u, v) 



exists with probability 



d -d B (u,v) 7 



■d-l-j 



, where do is some constant. We also give the changes in our analysis. 



Lemma [5] is straightforward in this model. Lemmas [6] and 10 still hold for the same reason as above. In 
the proof of Lemma [7J the first line of Eq (|2]), which uses the multiplication of edges' probabilities for an 
upper bound of the path's probability, still holds because it is now just the multiplication of independent 
events. For the case of d = 1 and < 7 < 1 (Section 3.3.2 1, we define £j to be the event that the edge 
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(vi,V(i + £ mo d n )) exists, one can see the analysis still works. All results of Theorem [T] still hold under this 
change. 

In summary, Theorem [T] of the case d > 1 and < 7 < d still holds for grid without wrap-around, 
and Theorem[T]of all cases still holds for both variants of edge linking. The variants of edge linking can be 
combined with grid without wrap-around, for which Theorem[T]of the case d > 1 and < 7 < d still holds. 



4 £-hyperbolicity of ringed trees 

In this section, we consider the <5-hyperbolicity of graphs constructed according to a variant of the small- 
world graph model, in which long-rang edges are added on top of a base graph that is a binary tree or 
tree-like low-5 graph. In particular, we will analyze the effect on the 5-hyperbolicity of adding long-range 
links to a ringed tree base graph; and then we will consider several related extensions, including an extension 
to the binary tree. 

Definition 6 (Ringed tree). A ringed tree of level k, denoted RT(k), is a fully binary tree with k levels 
(counting the root as a level), in which all vertices at the same level are connected by a ring. More precisely, 
we can use a binary string to represent each vertex in the tree, such that the root (at level 0) is represented 
by an empty string, and the left child and the right child of a vertex with string a are represented as aO 
and oT, respectively. Then, at each level i = 1, 2, . . . , k — 1, we connect two vertices u and v represented 
by binary strings a u and a v if (a u + 1) mod 2* = a v , where the addition treats the binary strings as the 
integers they represent. As a convention, we say that a level is higher if it has a smaller level number and 
thus is closer to the root. 

Figure[jjd) illustrates the ringed tree RT(6). Note that the diameter of the ringed tree RT(k) is ©(logn), 
where n = 2 k — 1 is the number of vertices in RT(k), and we will use RT (00) to denote the infinite ringed 
tree when k in RT(k) goes to infinity. Thus, a ringed tree may be thought of as a soft version of a binary tree; 
and to some extent, one can view a ringed tree as an idealized picture reflecting the hierarchical structure 
in real networks coupled with local neighborhood connections, such as Internet autonomous system (AS) 
networks, which has both a hierarchical structure of different level of AS'es, and peer connections based on 
geographical proximity. 



4.1 Results and their implications 

A visual comparison of the ringed tree of Figure [jjd) with the tessellation of Poincare disk (Figure [jjb)) 
suggests that the ringed tree can been seen as an approximate tessellation or coarsening of the Poincare disk. 
Our first result in this section makes this precise; in particular, we show that the infinite ringed tree and the 
Poincare disk are quasi-isometric. 

Theorem 2. The infinite ringed tree RT(oo) and the Poincare disk are quasi-isometric. 
Thus, by Proposition [TJ we immediately have the following result. 

Corollary 15. There exists a constant c s.t.,for all k, ringed tree RT(k) is c-hyperbolic. 



Alternatively, we also provide a direct proof of this corollary (Section |4.3.3 1 to show that the ringed tree 



RT(k) is Rips 5-hyperbolic, and Gromov's 40-hyperbolic in terms of the four point condition. Our direct 
analysis also provides important properties of ringed trees that are used by later analyses. 
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Next, we address the question of whether long-range edges added at each level of the ring maintains 
or destroys the hyperbolicity of the base graph. Given two vertices u and v at some level t of the ringed 
tree, we define the ring distance between u and v, denoted cIr(u,v), to be the length of the shorter path 
connecting u and v purely through the ring edges at the level t. Given any function / from positive integers 
to positive integers, let RT(k, f) denote the class of graphs constructed by adding long-range edges on the 
ringed tree RT{k), such that for each long-range edge (u, v) connecting vertices u and v at the same level, 
du(u, v) < f(n), where n = 2 k — 1 is the number of vertices in the ringed tree RT(k). Since long-range 
edges do not reduce distances from root to any other vertices, the diameter of any graph in RT(k, /) is still 
6(log n). Define S(RT(k, /)) = max Geiff(fci/) 5(G). 

Our second result (used in the proof of the first part of our next result, but explicitly stated here since it 
is also of independent interest) is the following. 

Theorem 3. 5(RT(k, /)) = 0(log/(n)), for any positive function f and positive integer k, where n = 
2 k — 1 is the number of vertices in the ringed tree RT(k). 

This result indicates that if the long-range edges added do not span far-away vertices, then the graph should 
have good hyperbolicity. In particular, if we take f(n) = logn, then the theorem implies that the class 
RT(k, f) is logarithmically hyperbolic. The theorem covers all (deterministic) graphs in the class RT(k, /). 
We can extend it to random graphs, such that if we can show that with high probability the random graph 
is in the class RT(k, /), then we know that the hyperbolic 5 of the random graph is 0(log /(n)) with high 
probability. The first result in the next theorem is proven via this approach. 

Next, we consider adding random edges between two vertices at the outermost level, i.e., level k — 1, 
such that the probability connecting two vertices u and v is determined by a function g(u, v). Let Vk-\ 
denote the set of vertices at level k — 1, i.e., the leaves of the original binary tree. Given a real-valued 
positive function g(u,v), let RRT(k,g) denote a random graph constructed as follows. We start with the 
ringed tree RT(k), and then for each vertex v G Vk-i, we add one long-range edge to a vertex u with 
probability proportional to g(u, v), that is, with probability g(u, v)p~ where p v = ^2 ueVk 1 g(u, v). 

We study three families of functions g, each of which has the characteristic that vertices closer to one 
another (by some measure) are more likely to be connected by a long-range edge. The first two families use 
the ring distance dji(u, v) as the closeness measure. In particular, the first family uses an exponential decay 
function gi(u, v) = e~ adR<yU,v \ The second family uses apower-law decay function g2(u, v) = dji(u, v)~ a , 
where a > 0. The third family uses the height of the lowest common ancestor of u and v, denoted as h(u, v), 
as the closeness measure, and the function is g^ = 2~ ah ( u ' v \ Note that this last probability function matches 
the function used by Kleinberg in a small-world model based on the tree structure 11231 . The following 
theorem summarizes the hyperbolicity behavior of these three families of random ringed trees. 

Theorem 4. Considering the follow families of functions (with u and v as the variables of the function) 
for random ringed trees RRT(k, g), for any positive integer k and positive real number a, with probability 
1 — o(l) (when n tends to infinity), we have 

1. 5(RRT(k,e- adR ^)) = 0(loglogn); 

2. 6(RRT(k,d R {u,v)- a )) = G(logn); 

3. 5(RRT(k,2- ah( - u ^)) = 9(logn); 

where n = 2 k — 1 is the number of vertices in the ringed tree RT(k). 
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This theorem states that, when the random long-range edges are selected using exponential decay function 
based on the ring distance measure, the resulting graph is logarithmically hyperbolic, i.e., the constant 
hyperbolicity of the original base graph is degraded only slightly; but when a power-law decay function 
based on the ring distance measure or an exponential decay function based on common ancestor measure 
is used, then hyperbolicity is destroyed and the resulting graph is not hyperbolic. One may notice that the 
function form in (1) and (3) above is similar but the result is different. This is because with height h(u, v) 
the subtree covers actually Q(2 h ( u < v ^) leaves, and thus (3) is naturally closer to the power-law function of 
(2). Intuitively, when it is more likely for a long-range edge to connect two far-away vertices, such an edge 
creates a shortcut for many internal tree nodes so that many shortest paths will go through this shortcut 
instead of traversing through tree nodes. (In Internet routing this is referred to as valley routes). 

Finally, as a comparison, we also study the hyperbolicity of random binary trees RBT(k, g), which is 
the same as random ringed trees RRT(k, g) except that we remove all ring edges. 

Theorem 5. Considering the follow families of functions (with u and v as the variables of the function) 
for random binary trees RBT(k, g), for any positive integer k and positive real number a, with probability 
1 — o(l) (when n tends to infinity), we have 

5(RBT(k,e- adR{u ' v) )) = 5(RBT(k,d R (u,v)- a )) = 5(RBT(k, 2~ ah ( u ' v ^)) = 8(logn), 

where n = 2 k — 1 is the number of vertices in the binary tree RBT(k, g). 

Thus, in this case, the original hyperbolicity of the base graph (5 = for the binary tree) is destroyed. 
Comparing with Theorem[4j our results above suggest that the "softening" of the hyperbolicity provided by 
the rings is essential in maintaining good hyperbolicity: with rings, random ringed trees with exponential 
decay function (depending on the ringed distance) are logarithmically hyperbolic, but without the rings the 
resulting graphs are not hyperbolic. 



4.2 Outline of the analysis 



In this subsection, we provide a summary of the proof of the four theorems in Section 4.1 For Theorem [2] 



we provide an embedding of the ringed tree to the Poincare disk, intuitively similar to the picture we show 
in Figure [TJd), and prove that it is a quasi-isometry. 

For the analysis of 5-hyperbolicity, we apply the Rips condition, which is equivalent to the Gromov's 
four point condition up to a constant factor. For any two vertices u and v on the ringed tree RT(k), we 
define the canonical geodesic (u, v) to be the geodesic from utoti such that the geodesic always goes up 
first, then follows ring edges, and then goes down (any of these segments may be omitted). We show that the 
canonical geodesic (u, v) and any other geodesic [u, v] are within distance 1 of each other, and any triangle 
A(u,v,w) formed by three canonical geodesies (u, v), (u,w), and (v,w) (called canonical triangle) are 
3-slim. This immediately implies that any geodesic triangles in RT(k) is 5-slim, which is a direct proof that 
ringed trees are constantly hyperbolic. 

For Theorem|3j we inductively prove that any geodesic [u, v] in RT(k, f ) is within 0(log f(n)) distance 
from the canonical geodesic (u, v), and vice versa. Together with the result that any canonical triangle is 3- 
slim, it follows know that any geodesic triangle is 0(log /(n))-slim. For Theorem |4j Part (1), we show that 
with high probability the long-range edges only connect vertices within ring distance 0(log n), and then we 
can apply Theorem[3]to achieve the 0(log log n) bound. For Theorem]?} Part (2), the key is to show that (a) 
with high probability some long-range link connects two vertices at ring distance 0(n c ) for some constant 
c; and (b) if such a long-range edge (u, v) exists, then we consider the geodesic triangle A (it, v, r) where r 
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is a point with lowest layer number on the canonical geodesic between u, v, and show that the middle point 
of [r, u] is 6(logn) away from the union of [r, v] and [v,u]. For Theorem |4j Part (3), we first show that 
with high probability some pair of vertices u, v with h(u, v) > clog 2 n for some constant c > 0, and then 
we observe that, in such configuration, the ring distance d R (u, v) has high probability to be Q,{n c / 2 ), and 
results follows exactly the same analysis in the previous part. 

For Theorem [5} part (2) and (3) follow a similar strategy as those of Theorem [4] For part (1), we know 
that two "would-be" ring neighbors u and v have constant probability of having a long-range connection. 
However, since we do not have ring edges, the alternative path between u and v through the tree may be 
6 (log n) in length. We show that there are at least Q,(^fn) such pairs, so with high probability at least one 
pair is connected, generating a bad 5 of i7(log n). 

4.3 Detailed analysis on ringed trees 
4.3.1 Properties of ringed tree 

We start by some properties of ringed tree, which will be repeatedly used in the following analysis on ringed 
tree related graphs and which may be of independent interest. 

We define the ring distance d R (u, v) of u and v on the same level to be their distance on the ring. Ringed 
trees have the following fundamental property. 

Lemma 16. Let u and v be two vertices on the same level, and v! and v' be their parents respectively. We 
have d R (u', v') < (d R {u, v) + l)/2. 

Proof. On the ring, there are d R (u, v) + 1 vertices on segment between u and v, which belong to at most 
(d R (u, v) + 1)/2 + 1 parents, which correspond to at most (d R (u, v) + 1)/2 + 1 vertices on the ring segment 
between v! and v' . This concludes the proof. □ 

For a geodesic [u, v] on the ringed tree RT(k), we call its level sequence the sequence of levels it passes 
by from uto v. Lemma [T6| implies that the level sequence of any geodesic must be reversed unimodal: it 
first decreases, and then increases (but the increasing or decreasing segment may be omitted). The following 
lemma further characterizes geodesies in ringed-trees. 

Lemma 17. Let u, v be two vertices, u be on level £, and v! be the parent ofu at level £ — 1. Suppose [u, v] 
intersects level £ — 1, and let t be the intersection closest to u. Then d(u, t) < 2, the segment [u, t] of [u, v] 
and {u, u'} are within distance 1 to each other. 

Proof. Let t' be the node just before t to u on [t, u]. d(t, u) < 1 + d R (t, v!) < 1 + (d R (t', u) + l)/2 by 
Lemma [T6| As t is the closest node on level £ — 1 on the geodesic to u, d(t, u) > 1 + d R (t' , u). We get 
d(t ; , u) < 1 and d(t, u) < 2 by combining these two inequalities. The segment [u, t] of [u, v] and {u, u'} 
are within distance 1 to each other since d(t, u') < 1. □ 

For two vertices u and v, we define the canonical geodesic (n, v) in a recursive fashion. 

1. For u, v on the same level and d R (u, v) < 3, (u, v) is the path on the ring from u to v. 

2. For u, v on the same level but d R (u, v) > 3, let u' , v' be parents of u, v respectively, then (u, v) = 

[u,u'] U (u',v') U [v',v]. 

3. For u, v on different levels, supposing u on upper level, let v' be parent of v, then (u, v) = {u, v') U 

[v',v]. 



25 



This is a well-founded definition. At each level of recursion, either the difference of levels of nodes de- 



creases, or in the case of nodes on the same level, ring distance decreases by Lemma 16 until we reach the 
base case, where dji(u, v) < 3. 

We prove now that canonical geodesies are really geodesies. 

Lemma 18. For any u, v, (u, v) is a geodesic between u, v. 

Proof. For u, v on the same level and dn{u, v) < 3, we can check that (u, v) is a geodesic between u, v. 

For u, v on the same level but dji(u, v) > 3, let u' , v' be parents of u, v respectively. Let t, s be the 
closest node to u, v to be in an upper level on [u, v] respectively. If t ^ u', then d(u, t) > 2 because the 
only way to go up one level in one step is to go to the parent. By Lemma [nj [u, t] and {u, u'} are within 
distance 1 to each other. Therefore d(u',t) = 1, as d(u,t) > 2, and {u, u'} U [t,v] is also a geodesic 
from u to v. This is also correct for t = u'. The same can be proved for s. By combining, we have that 
{u, u'} U [t, s] U {v 1 , v} is also a geodesic between u, v for any geodesies [t, s). If we pick the canonical 
geodesic (t, s), in any cases, this will be the canonical geodesic (u, v). Therefore, (u, v) is a geodesic. 

For u, v on different levels, the induction is essentially the same as in the previous case, but we only 
need to reason on t only on the side of u. 

This concludes our induction. □ 

4.3.2 Proof of Theorem |2| Quasi-isometry from infinite ringed tree to the Poincare disk 

In this subsection, we will exhibit and prove a quasi-isometry from the infinite ringed tree RT(oo) to the 
Poincare disk. We denote its distance djtx- We denote (D, dp) the Poincare disk, where D is the open disk 
of radius 1 on the complex plane. 

Here is a brief summary of our approach here. First we propose a candidate of quasi-isometry, then 
all possible cases of images of two points of ringed tree are divided into four categories, each of which 
is separately analyzed. We then proceed with an analysis on the metric of ringed tree and show that the 
candidate of quasi-isometry is effective, thus ringed tree and the Poincare disk are quasi-isometric. 

The following inequalities are used in the following. 

ln(x) < cosh -1 (a;) < ln{2x) for x > 1 

2 . . . vr 
—x < sm x < x for < x < — 
-k ~ y 1 ~ ~ ~ 2 

x 3^ 1 

1 < y/l-x < 1 for < x < - 

2 ~ 5 ~ ~ 2 

To state the promised quasi-isometry, we give coordinates to nodes in RT(oo). We know that we can 

number nodes with binary strings, which can be regarded as a number. For a node on the k-th level and 

numbered by m, its coordinates are (k,m), with < m < 2 k — 1. The root is level 0. 

Definition 7. Let the following mapping be the candidate of quasi-isometry : 

f : RT(oo) ->• D, (k, m) ^ \A - 2- k e 2i7T ^ 

For0<k<£,m< 2 l ~ x , we define D(k, £, m) = d P (f{k, 0), f(£, m)). This is a distance in the Poincare 
disk. Following is its full expression in k and m. 

II, /I _ o-k _ ,/i _ ?-l P 2in ^i ||2 
D(M,m) = co S h- 1 (l + 2^^ J_l e 6 11 ) 



cosh-^l + 2{^2 i (2 k - 1) - ^/2 k {2 e - l)) 2 + 8^(2^ - l)2 fc (2 fe - 1) sin 2 ^)) 
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We also define D'(k,£,m) = d RT (f(k,0),f(£,m)), with < k < I, m < 2* -1 . This is a distance in 
ringed tree. 

We will now try to bound D(k, £, m) with the following lemma. 
Lemma 19. We have the following bounds on D(k, £, m). 

1. For k = 0, 

if t + f <D {0 , t , m) <^ , + M «). 

2. For < k = £, m > 0, 

ln(2)(4 + 2Llog 2 mJ) <D(k,k,m) < ln(2)(4 + 2[log 2 m\ ) + ln(^)for(A; > 1). 

J. For m = 0, 1 < k < £, 

ln(2)(£ - jfe) - ln(50) < D(k, I, 0) < ln(2)(£ - k). 

4. ForO < k < £ and < m < 2 £ ^ k , 

ln(2)(£ - jfe) - ln(100) < D(jfe, £, m) < ln(2)(£ - jfe) + ln(66vr 2 ). 

5. For0<k<£ and 2 £ - k < m < 2 £ ~\ 

ln(2)(/c-£ + 2Llog 2 mJ +4) <D(k,£,m) < ln(2)(fc - £ + 2[log 2 mj + 6) + ln(vr 2 + 1). 

Proo/ 1. Case fc = 

£>(0, £, m) = cosh _1 (l + 2^2* - 1) 
If £ = 0, £>(0, 0, m) = 0. For £ > 1, we have 

D(0, £, m) > cosh" 1 (2^2^) > ^£ + ^ 

and 

£>(0,*,m) < cosh _1 (3V^) < ^^ + ln(6). 



2. Case < k = £,m> 



D(jfe, jfe, m) = cosh-^l + 2 k+3 (2 k - 1) sin 2 (^)) 

Let a = [log 2 m\ , we have 

D(k,k,m) > cosh" 1 (2 2fe+2 sin 2 (2 a - fe 7r)) > cosh- 1 (2 2/c+2 (2 a+1 - fe ) 2 ) > ln(2)(4 + 2a) 

and 

D(k,k,m) < cosh- 1 (^2 2fc+2 sin 2 (2 a+1 - /c vr)) < cosh- 1 (^2 2fc+2 (2 a+1 - fc 7r) 2 ) 
< ln(2)(4 + 2a) + ln(A ) 
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3. Case 0<fc<^,m = 



D(k,£,0) = cosh- 1 (l + 2( v / 2^(2 fc -l)- J2 k (2 e -1)) 



= cosh-^l + 2 k+e+1 (Vl~2~ k - Vl ~ 2- £ ) 2 ) 
As 1 < k < £, Vl - 2~ e - Vl - 2- k > 0, and we have 

Vl - 2-* - Vl ~ 2~ k > 1 - - 1 + -2~ k > -2- k - 1 

5 5 

and 

Vl - 2-z - Vl ~ 2- fe < 1 - -2- £ - 1 + 2~ k ~ 1 < 2- k - 1 

5 

Therefore, we have ln(2)(^ - k) - ln(50) < D(k, £, 0) < ln(2)(£ - k). 

4. Case 0<fc<^,0<m< 2 i ~ 1 

We now deal with the general case with m > and < k < i. 

D(k,e,m) = cosh- 1 (cosh( J D(A:, £, 0)) + 8^2 e (2 e - l)2 k (2 k - 1) sin 2 (vr^)) 

We always note a = [log 2 m\ , and we have 

^2^-l)2 fc (2 fc -l)sin 2 (7r^) > 2 e+k ' 1 2 2a ^ e+2 = 2 k ' e+2a+1 



and 



^2^-l)2^-l)sm 2 (7r™) < 2 ^ fc 2 2a+2 - 2 V < 2 fc ^+ 2a +V 



2 e> 

The previous bound on D(k, £, 0) transforms into the following by applying cosh. 

^<cosh(£>(M,0))<2 € - fe 
Suitable substitution of cosh(D(&;, £, 0)) gives 

cosh -l(^ + 2 fe-^+2a+4 ) < D(fc) ^ m) < cosh - 1 (2^ fc + 2 fe ^+ 2a + 5 7T 2 ). 

For < a < ^ - k, therefore < m < 2 £ ~ k , 

D(k, £, m) > cosh"^^ + 2 fc ^+ 2a + 4 ) > ln(^) = ln(2)(£ - k) - ln(100) 

and 

D(k,£,m) < cosh~ 1 (2^ fc + 2 fe ^ +2a +V) 

< ln(2 £ - fe 33vr 2 ) + ln{2) = \n{2){£ - k + 1) + ln(33^ 2 ). 

For £ - k + I < a < m - I, therefore 2 e ~ k < m < 2 1 ' 1 , 

D(k, £, m) > coshT^ — + 2 k - £+2a+A ) > ln(2 fc - £+2a+4 ) = ln(2)(ife - £ + 2a + 4) 
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and 



D(k,e,m) < cosrr 1 (2^ fc + 2 fc ^ +2a+ V) 

< ln((vr 2 + i) 2 k - e+2a + 5 ) + ln{2) = \n{2){k - I + 2a + 6) + ln(vr 2 + 1). 

□ 

We will now try to relate D(k, £, m) and D'{k, £, m) in the following lemma. 
Lemma 20. For any < k < £,0 < m < 2 l ~ x , 

l -^p-D'(k, £, m) - ln(200) < D(k, £, m) < \n{2)D'(k, £, m) + ln(66vr 2 ) 

Proof. Consider the canonical geodesic. If k = £ and m = 1, it is an edge from (k, 0) to (k, 1). In other 
cases, it goes up first from (£, m) to a certain ancestor, then makes 2 or 3 moves on the ring, and finished by 
going straight down to (k, 0). 

In the first case, D'(k, k, 1) = 1. In the second case, as the ancestor of any node (k, m) is (k — 1, ), 
by the form of canonical geodesies, we should first go up £ — k steps to reach level k. If m < 2 l ~ k+l , it 
reaches (k, 0) by at most an extra step. In this case, £ — k < D'(k, £,m) < I — k + 1. If rn > 2 i ~ k+1 , 
we go up [log2 m\ — 1 steps from (£, m) to reach an ancestor numbered 2 or 3, then go 2 or 3 steps on 
the ring to the node numbered on the same level, and finish by going down to (k, 0). Thus we have 
2|_log 2 m\ + k - £ < D'(k, £, m)2|_log 2 m\ + k - £ + 1 in this case. 

We conclude by comparing to bounds in Lemma [T9| □ 



We want to bound distance between any two points in RT(oo) with the following lemma. 

Lemma 21. For u\ , V\ , u<i , V2 £ RT(oo) withu\,v\,U2,V2 on the same level and dn{u\,V\) = (Ir(u2,V2), 
we have \d(ui, V\) — d{u2,V2)\ < 3. 

Proof. Let A = dR(u\, v\). It is clearly correct when A < 3 by the structure of canonical geodesies. 

For A < 3, when we consider the canonical geodesies, we take successive ancestors of u\ and v\ until 
their distance is 2 or 3. This takes at least [log 2 vlj — 1 generations, but at most Llog 2 (^4 — 1)J. These 
bounds differ by at most 1. Platforms differ by at most 1, therefore \d{u\,v\) — d(ti 2 , f 2 )| < 3. □ 



Proof of Theorem^ By symmetry, Lemma 21 and Lemma 20 for any u, v G Rt(oo), we have: 

l ^d RT (u,v) -ln(200) < d P (f(u),f(v)) < ln(2)d RT (u,v) + ln(66vr 2 ) 

There is only one thing left to prove quasi-isometry. We will now prove that for some constant e, 
B(f(RT (oo)), e) covers the Poincare disk. Images of each level are all on concentric circles, and the differ- 
ence of radius between successive levels can be bounded by m(6). Distance between images of neighboring 
nodes on the same level can be bounded by In (16). For any point in the Poincare disk, its distance to 
the nearest image of nodes is bounded by ln(96), by first moving straight away from until reaching a 
concentric circle of images, then take the shortest path to reach image of a certain node. 

This proves / to be a ( j^y, ln(667r 2 ))-quasi-isometry from RT(oo) to the Poincare disk, thus RT(oo) 
and the Poincare disk are quasi-isometric. Constants we found here are not tight. □ 
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4.3.3 A direct proof of Corollary |I5| 

We begin with a lemma about the distance between a general geodesic and the corresponding canonical 
geodesic. 

Lemma 22. For any geodesic [u, v], [u, v] and (u, v) are within distance 1 to each other. 

Proof. We perform an induction on the structure of (u, v). 

For u, v on the same level and dji(u, v) < 3, we can check that [u, v] and (u, v) are within distance 1 to 
each other. 

For u, v on the same level but dji(u, v) > 3, let u' , v' be parents of it, v respectively. We have (u, v) = 



(u, u') U (u' , v') U (v' , v) by construction. By Lemma 17 the parts of [u, v] that are on the same level with 



u, v verify the condition already. We only need to deal with the part on levels with lower numbering. 

Let t, s be the closest node to u, v that are in an upper level on [u, v] respectively. If t ^ v! , then 
d(u, t) > 2 because the only way to go up one level in one step is to go to the parent. By Lemma \TJ\ [it, t] 
and (u, v!) are within distance 1 to each other. Therefore d(u', t) = 1, as d(u, t) > 2, (u, v!) U (vf, t) U [t, v] 
is also a geodesic from u to v. This is also correct for t = u'. The same can be proven for s. By combining 
the above, we have that (u, v!) U (u', t) U [t, s] U (s, v') U (v', v) is also a geodesic between u, v. Thus the 
section (u', t) U [t, s] U (s, v') is also a geodesic from v! to v' . By induction hypothesis, it is within distance 
1 to (u\ v'). However, this geodesic contains the part of [u, v] on levels with lower numbering than that of 
u, v, which is precisely [t, s]. We conclude that [u, v] and (u, v) are within distance 1 to each other. 

For u, v on different levels, the induction is essentially the same as in the previous case, but we only 
need to reason on t on the side of u. 

This concludes our induction. □ 

Lemma 23. Let u and v be two vertices at the same level £ such that (u, v) stays at level L Then for any 
vertex w, (u, w) and (v, w) are within distance 3 of each other. 

Proof. Let Iq and £' be the highest levels reached by (u,w) and (v,w), respectively. Without loss of 
generality, we assume that £q > £' . Thus £ > £q. We prove the lemma by an induction on I. 

Consider the base case of I = £q. Let w' be the ancestor of w at level £q. Thus we know that both (w', u) 
and (u, v) stay at level £q. By definition dji(w' , u) < 3 and dji(u, v) < 3. We can enumerate all the possible 
cases of u, v, w' arrangement to check that (w', v) is at most one level higher than £q (i.e., £' Q > £q — 1), and 
(w', v) and (w', u) are within distance 3 of each other. Since (u, w) and (v, w) share the portion (w, w'}, 
the lemma holds for the case of £ = £q. 

For the induction step, consider £ > £q. Let v! and v' be the parents of u and v, respectively. By 



Lemma 16 and dn(u,v) < 3, we know that dn(u',v') < 2. Then (u',v') must stay at level £ — 1. By 
induction hypothesis, (u', w) and (v' , w) are within distance 3 of each other. Since (u, w) = (u, u')U(u' , w), 
(v, w) = (v, v') U (v',w), and dn(u, v) < 3, we know that the lemma holds in this case. □ 

We define canonical triangle A(u,v,w) to be a geodesic triangle in which all sides are canonical 
geodesies. 

Lemma 24. Any canonical triangle A(n, v, w) in RT(k) is 3-slim. 

Proof. Without loss of generality, let u be the vertex at the lowest level, which is I. We prove the lemma by 
an induction on I. The base case of £ = is trivial. 

Consider the induction step with £ > 0. If neither v nor w is at level £, then both (u, v) and (u, w) go 
through it's parent v! . By induction hypothesis, we know that A(n', v, w) is 3-slim. Adding u in this case 
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does not change the distance among the sides, and thus A(u, v, w) is also 3-slim. Suppose now that v or w 
or both are at level I. In the first case, suppose that at least one pair, say u and v, is such that (u, v) stays 



at level I. By Lemma 23 we know that (u, w) and (v, w) are within distance 3 of each other. Since (u, v) 
has length at most 3, we know that A(u, v, w) is 3-slim. In the second case, suppose that all pairs at level 
I have their canonical geodesies into level £ — 1. For each vertex at level £, we take its parent, together 
with perhaps another vertex already within level £ — 1, we can apply the induction hypothesis and show that 
their canonical triangle is 3-slim. Since for every vertex at level £, its canonical geodesies to the other two 
vertices all go through its parent, the vertices at level £ do not change the distance between any pair of sides 
of the canonical triangle. Therefore, A(u, v , w) is 3-slim. □ 



With Lemmas 22 and 24 we can provide a direct proof of Corollary [T5| 



Direct proof of Corollary 15 We use Rips condition here. For any u, v, w, By Lemma 22 we have [u, v] C 



B((u, v), 1). By Lemma 24 we have (u,v) C B((u,w) U (v,w),3). By Lemma 22 again, we have 
(u, w) U (v, w) C B([u, w] U [v, w], 1). Therefore, we conclude that [u, v] C B([u, w] U [v, w], 5), and thus 
5 Rips {RT(k)) < 5. Since 5(RT(k)) < 85 Rips (RT(k)), we have 5(RT(k)) < 40. □ 

4.3.4 Proof of Theorem g] 

We now apply Rips condition to analyze the hyperbolicity of ringed trees with limited long-range edges 
RT(k, /). First we show that a geodesic in RT(k, f) cannot make too many hops at the same level of 
the ringed tree. For any two vertices in a graph G G RT(k, /), let [u, v] denote any one of the geodesies 
between u and v in G, while (u, v) still denote the canonical geodesic between u and v in the base ringed 
tree RT(k). 

Lemma 25. For any u,v on the same level in RT(k) with d(u,v) > 1, 2 log 2 dn(u, v) < d(u,v) < 
2log 2 (d R (u,v)-l) + 2. 

Proof. We perform induction on d R (u,v). If 1 < d R (u,v) < 3, we can check that it is correct. If 
dn{u,v) > 3, let it',?/ be parents of u,v respectively. By induction hypothesis, 21og 2 dn(u', v') < 
d(u',v') < 2log 2 (d R (u',v')-l) + 2. By triangle inequality, d{u, v) < d(u',v') + 2 < 2 \og 2 (d R (u', v') - 
1) + 4. Butd R (u',v') - 1 < (d R (u,v) - l)/2 by Lemma[l6j And we have d(u,v) < 2 log 2 (d R (u', v') - 
1) + 4 < 2\og 2 {d R (u,v) - 1) + 2. On the other hand, d(u,v) > 2 + d(u',v') > 2 \og 2 {2d R (v! , v')). As 
d R (u', v') > d R (u, v)/2, we have d(u, v) > 2 log 2 d R (u, v). Combining this two inequalities concludes the 
induction. □ 

Corollary 26. For any u, v on the same level in RT(k), d(u, v ) < 2 log 2 d R (u, v) + 2. 

Proof. We check for the case d(u, v) = 1 and it is satisfied. For d(u, v) > 1, this results directly from 

d(u, v) < 2 \og 2 {d R {u, v) - 1) + 2 < 2 log 2 d R (u, v) + 2. □ 

Lemma 27. For a graph in the class RT(k, /) and two vertices u and v at the same level j, if [u, v] never 

goes into vertices at level i < j, then d(u,v) < max(32, 41og 2 /(n))— landd R (u, v) < f(n) max(32, 41og 2 f(n))- 

1. 

Proof. First, we will prove d R (u, v) / f(n) < d(u, v) < 2 log 2 d R (u, v) + 2. The first inequality is because 
the shortest distance from u to v without going into vertices in level i < j in the base ringed tree RT(k) 
is d R (u, v), and each long-range edge can jump at most f(n) hops on the ring at any level. The second 



inequality results directly from Corollary 26 Let x = d(u, v). From the above two inequalities, we have 
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x < 2\og 2 x f(n) + 2. Then, f(n) > l^-^l 2 j x > 2^ +1 ^ i for x > 32, and thus x = d(u,v) < 
max(32,41og 2 /(n)) - 1. It follows that d R (u, v) < f{n) max(32, 41og 2 /(n)) - 1. □ 

Lemma 28. For a graph in the class RT(k, f(n)) and two vertices u and v, [u, v] and (it, v) are within 
distance 2 max(32, 4 log 2 f(ji)) to each other. 

Proof. The case of u and v are ancestor and descendant to each other on the tree are trivial. Thus we 
consider the case that u and v are not ancestor and descendant to each other. Let £q and l\j be the innermost 

level that geodesies (u, v) and [u, v] reach, respectively. If i\j < £q, then [u, v] uses at least two more tree 
edges than (it, v). This means (u, v) uses at least two ring edges. If (u, v) uses exactly two ring edges, all 
edges in [it, v] must be tree edges, which is impossible to be a geodesic given that u and v are not ancestor 
and descendant to each other. Thus, (u, v) uses exactly three ring edges. Then [it, v] uses exactly one ring 
or long-range edge. Let v! and v' be the ancestors of it and v at level £q , respectively. In this case, the only 
possible situation is : (a) i\j = £q — 1, and (b) the parents of v! and v' are connected either by a ring edge 
or a long-range edge, which is used by [u, v]. Hence, [u, v] and (it, v) share the tree edges from it to v! and 
v' to v, and (u, v) goes through ring edges from it' to v' while [it, v] goes through the edge connecting the 
parents of u' and v'. We thus have that [it, v] and (it, v) are within distance 1 of each other. Therefore, from 
now on, we consider i\j > £q . 

Let £ u and £ v be the levels of u and v respectively. Without loss of generality, We can suppose that 
! > f > 

For any level £ both reachable by [it, v] and (it, v), i.e., £q < £ < £ u , let xi be the first level-^ vertex on 
geodesic [u, v] starting from u, and let yi be the first level-£ vertex on geodesic (u, v) starting from u. We 
claim that d R (x e , y t ) < f(n) max(32, 41og 2 f(n)). 

For level £ u , it is trivial. Suppose that our claim is correct for some level £ > £q, and we inductively 
prove the claim for level £—1. By the induction hypothesis, we know that d R (xe, yi) < f(n) max(32, 4 log 2 /( 



Let x' t be the level-^ vertex just before x^_i on [it, v] starting from it. By Lemma 27 and the fact that the 
portion [xg, x' e ] never goes to level i < I, we have d R (xe, x' e ) < f(n) max(32, 41og 2 /(n)) — 1 . Therefore, 
dR{x' e ,y e ) < d R (x e ,y e ) + d R (x e ,x' e ) < 2f(n) max(32,41og 2 f(n)) - 1. Since and are the 



parents of x' e and yt respectively, by Lemma 16 we have d R (xe-i, ye-i) < f[n) max(32, 41og 2 fin)). Our 
claim holds for level £ — 1. By induction, our claim stands. 

We thus have d(x e ,y e ) < 2log 2 (d R {x e ,y e )) + 2 < max(32, 4 log 2 f(n)), for all i\j < I < £ u . For 

we have 
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any vertex x between xg and x' £ on [it, v] (note that x may be at a level £' > £), by Lemma 
d(x, yi) < d(x, xi) + d(xe, ye) < d(x' e , xi) + d(xe, ye) < 2 max(32, 4 log 2 f(n)). Hence all such vertices 
x are within distance 2 max(32, 4 log 2 f(n)) from vertex ye in (it, v), and vice versa. 

Similarly, we can define ze to be the the first levels vertex on geodesic [v, u] starting from v, and we to 
be the first level-^ vertex on geodesic (v, u) starting from v, for all £$ < £ < £ v . By a symmetric argument, 
we can show that we are within distance 2 max(32, 4 log 2 f(n)) from all vertices in the segment of [u,v] 
from ze-\ to ze for£ > £\j, and d{z \},w \\) < max(32,41og 2 f(n)). 

The only portion left to argue is from x [\ to zm in geodesic [it, v], and from y [] to w [] in geodesic 



(u, v). By Lemma 
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and the definition of trj, we know that d(x n , z \\) < max(32, 41og 2 f(n)). There- 
to *o 

fore, all vertices irPthe segment from xj) to z \\ in geodesic [u,v] are within 2max(32, 41og 2 f(n)) to 

both y [] and wm. Now, for any vertex x in the segment from y j\ to w.[] in geodesic (u,v), we need to 
e i e e Q 

bound the distance from a; to [u,v]. Since d R (y u ,w u ) < d R (y u , x u ) + d R (x u , z u ) + d R (z u , w u ) < 
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3/(n) max(32, 4 log 2 /(n)), we have d(y u ,w u ) < 21og 2 d R (y u , w n ) + 2 < max(32, 41og 2 /(n)). 
Thus, for any vertex x in the segment from y \\ to u) n in geodesic (u,v), x can reach either x m and 
z [] in at most 2 max(32, 41og 2 / (re)) hops. Therefore, x is within distance 2 max(32, 4 log 2 /(n)) from 
[u,v]. □ 



Proof ofTheo rem [?] We use Rips condition here. For any u, v, w, by Lemma 28 we have [u, v] C B((u, v 



2max(32,41og 2 /(n))). By Lemma 24 (u,v) C to) U («, w), 3). Again by Lemma |28j (u,w) U 

C B([u, w]U [v, w], 2 max(32, 4 log 2 f(n))). Combining these together, we have [u, v] C i?([u, w]U 
[u, 4max(32, 41og 2 /(n)) + 3). Therefore, since 5 and 5 R i ps differ within a constant factor, we have 
5(RT(k, /)) < clog/(n) for some constant c. □ 

4.3.5 Proof of Theorem H 

We first analyze the (5-hyperbolicity of RRT(k, e -^ R (u,v)y 

Proof of Theorem^ part 1. e a < p = £„ e y\{ u } e~ dR ^ a < 2^t^e ia . Therefore, p = 6(1). A 
vertex u on the leaves of RT(k) has a long-range edge with ring distance greater than k with probability 
e{e~ ka )/p. Let k = ^ log n, we know that a vertex has a long-range edge of ring distance greater than 
| log n with probability 0(l/n 2 ) = o(l/n). 

Therefore, with probability 1 — o(l), long-range edges never exceed ring distance ^ logn. From Theo- 
rem|3|it follows that 5(RRT(k, e - ad ^ u > v ))) = O(loglogn), for any a > 0. □ 

For the case of 5(RRT(k, cIr(u, v)~ a )), we first look at a lemma about the effect of long-range edges 
with ring distance S7(n c ) on 5-hyperbolicity of ringed trees. 

Lemma 29. If we add an edge between u and v on the outermost ring of a k level ringed tree with dn{u, v) > 
c'n c , for some constants c and c', then the resulted graph G (possibly with other edges on the outermost 
ring) has 5(G) = ri(logn). 



Proof. Let wbea node of lowest layer number on (u,v). By Lemma 25 and the structure of canonical 
geodesic, d(w,u) > (21og 2 c'n c — 3)/2 > clog 2 n + log 2 c' — 3/2. We consider the midpoint x of 
[it, w]. For any point y in [v,u>], by considering the canonical geodesic (x,y), we know that d(x,y) > 
(clog 2 n + log 2 d — 3/2)/2 — 3. Therefore A(u, v, w) is at best ((clog 2 n + log 2 d — 3/2)/2 — 3)-slim, 
and thus 8(G) = 5 Rips (G) = Q{log n). □ 

A probabilistic version comes naturally as the following corollary. 

Corollary 30. For a random graph G formed by linking edges on leaves of a ringed tree, if for some constant 
c with < c < 1, with high probability there exists an edge linking some u and v with d R (u, v) = Q(n°), 
then with high probability 5(G) = G(logn). 



Proof. Diameter of ringed tree gives 0(log n) upper bound. Lemma 29 gives O(log n) lower bound. □ 

We can now estimate the 5-hyperbolicity of RRT(k, d R (u, v)~ a ). 

Proof of Theorem^ part 2. Note that in a ringed tree with n vertices, at least re/2 of them are leaves. For 
a constant c with < c < 1 and a fixed vertex u, the probability that the long-range edge (u, v) has 

d R (u, v) < (re/4) c (we say that it is good) is p = 2p~ l Y}d=t )C d ~ a > where P = e (ELi d ~ a )- 
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For < a < 1, p = 0(n^ 1_Q; ^ c_1 ') = o(l). For a = 1, p = c + o(l). In these two cases, all edges are 
good with probability at most (c + o(l))^ n ' 2 ' = o(l). 

For a > I, p = 0(1). We take g = 1 — p. We have g = 0(n c ( 1_a )). By picking c = min(l, 2(0.-1) )» 

we have q = 0(n -1 / 2 ). All edges are good with probability (1 — q) 11 / 2 = 0(e~^™) = o(l). 

In all three cases, by Corollary|30| with probability 1 - o(l), 5(RRT(k, d R (u, v)~ a )) = G(log n). □ 

Finally, we estimate the 5-hyperbolicity of RRT(k, 2~ ah( - u ^). 



h-i 



Proof of Theorem^ part 3. We note re^ = (re + l)/2 the number of leaves. Fix a leaf u. There are 2 
leaves v such that h(u,v) = h. Therefore p = Y^h=\ nL 2 h ~ 1 ~ ah . For a constant c with < c < 1, let 
p(c) = p- 1 J2h°f 2nL 2 h - 1 - ah be the probability that u never links to any v with /i(u, v) > clog 2 n. We 
have p = 2~ a n 1 I ^ a /(l — 2 1 ~ a ) for a ^ 1, and p = ^ log 2 for a = 1. For a^l, p(c) = n^ 1- "^ 1- . 
For a = 1, p(c) = c. 

For the case a < 1, p(l/2) = O(l). Therefore with probability (1 — p(l/2))" L = o(l) there exists 
some u,v linked together with h(u,v) > ^log 2 n. For a > 1, we take constant cq = min(l, 2 (a-i) )» 

— 1/2 / 

and p(co) = n L . Therefore with probability (1 — p(co)) nL = 0(e v n £) = (1) linked together with 
h(u, v) > Co log 2 ul. In any cases, with 1 — o(l) probability, there exists u, v linked together by long-range 
edge with h(u, v) > clog 2 til for some constant c. We notice that this occurs uniformly through all edges. 

Given u, v with h(u,v) = h, dn(u,v) < 2 h l 2 with probability at most 2 h 2 1 ~ 2h = 2 1 ~ h by simply 
counting pairs within ring distance 2 h l 2 . With h(u, v) > clog 2 ul, c > 0, we know that with probability 
1 — 2n^ c = 1 — o(l) we have dji(u,v) > 2 h l 2 = n c / 2 . Combining with the previous analysis, we 
prove that with 1 — o(l) probability, there exists u, v linked together by long-range edge with (Ir(u, v) > 
n c / 2 for some constant c > 0. By Corollary 30 and by > re/2, with probability 1 — o(l) we have 



5(RRT(k, 2- ah M)) = e(logn). □ 
4.3.6 Proof of Theorem g] 

We can order a binary tree to give it a ring distance. We will suppose that such a distance is defined 



hereinafter. We begin with a counter part of Lemma 29 in binary tree 



Lemma 31. If there is an edge between two leaves u,v of a binary tree of size n with distance to lowest 
common ancestor h(u, v) = c\ log 2 n + c 2 for some constant c\ > 0, c 2 > 0, then the resulted graph G 
(possibly with other edges on the outermost ring) has 5(G) = r2(log n). 

Proof. Consider w the lowest common ancestor of u, v, and x the midpoint of [u, w]. We have d(x, w) = 
d(w,u)/2 = h(u,v)/2. For any y in [w,v], d(x,y) > h(u,v)/2, as the only path in the tree from y to x 
always passes by w, and we need to climb h(u, v)/2 levels if we use links on leafs. Therefore A(u, v, w) is 
at best h(u, v)/2 = \(c\ log 2 re + c 2 ), and we conclude that 5(G) = r2(logre). □ 

Corollary 32. For a random graph G formed by linking edges on leaves of a binary tree, if for some constant 
c with < c < 1, with high probability there exists an edge linking some u and v with h(u, v) = 0(logn), 
then with high probability 5(G) = ©(log re). 



Proof. Diameter of binary tree gives 0(log re) upper bound. Lemma 31 gives f2(log n) lower bound. □ 
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Proof of Theorem^ For RBT(k, e~ adR ( u ' v ^), the height of the whole tree is h = [log 2 n\. There are 
B(y / n) subtrees of height h/2, with the root at the level h/2. For every neighboring such subtrees, the 
rightmost leaf u on the left subtree and the leftmost leaf v on the right subtree verifies cIr(u,v) = 1, 
h(u,v) > h/2. For each leaf, p = 0(1). Therefore, for u, v with dn(u,v) = 1, there is an extra edge 
between u, v with constant probability e~ a p~ 1 > 0. As there are yjn such pairs, with probability 1 — (1 — 
e~ a p~ 1 )v^ = 1 — o(l), there is a pair of leaves u, v linked by an extra edge with (Ir(u, v) = 1, h(u, v) > 
h/2 = e(logra). By Corollary I32) with probability 1 - o(l), 5(RBT(k,e~ adR ^)) = 9(logn) 



For RBT(k,d R (u,v)- a ) and RBT(k,2~ ah ^), using the same analysis in the proof of Theorem W] 
we know that for some constant c > 0, with 1 — o(l) probability, there is an extra edge between u, v with 
cIr(u, v) = Q,{n c ). We have h(u, v) = (log n ) because a subtree of height h spans a ring distance at most 



2 h . By Corollary 32 we have S(RBT(k, d R (u, v)~ a )) = 5(RBT(k, 2~ ah ^ v ))) = 0(log n). □ 



4.4 Extensions of random ringed tree model 

We will now discuss some extensions of the random ringed tree (RRT) model, and show that our results still 
hold for these extensions, thus extending its expressivity. 

We start from some observations in the proof of Theorem [4] In this proof, the upper bound of 8- 
hyperbolicity is given by Theorem [3j and the lower bound is given by Corollary [30] In the statement of 
Theorem [5J by the definition of RT(k, /), only a uniform bound of ring distance d R (u, v) for each long- 



range edge (u, v) is considered. In the statement of Corollary 30 the only quantity concerning a long-range 
edge (u, v) is also the ringed distance between u and v, and to apply this corollary, we only need to show that 
a long-range edge (n, v) with d R {u : v) = G(n c ) for some constant c exists with high probability. Therefore, 
the proof of Theorem [4] relies only on the ring distances of long-range edges. 

To extend the RRT model while keeping similar properties on 5-hyperbolicity, we only need to show 
that Theorem|3]and Corollary [30] are still applicable in these extensions. We will here discuss two extensions 
on choosing long-range edges. 

A constant number of long-range edges for each node. In the original RRT model, each node only have 
one long-range edge connecting to other nodes. We can extend the model to allow each node to have a 
constant number of long-range edges connecting to a constant number of other nodes. In this extension, 
Theorem |4] still holds, since Theorem[3]is not concerned by the number of long-range edges, and Corollary 



30 is still applicable as the required probability only increases with extra long-range edges. 



Independent long-range edges. In the original RRT model, we choose exactly one long-range edge for 
each node. A variant of the model is that each node u can choose edge (u, v) independently from other 
edges (u,v'), with the same probability as in the original model such that on expectation u connect out with 
one long-range edge. In this variant, Theorem [4] still holds. The reason is that in expectation, at least a 
constant fraction of nodes issue only one edge, and the computation for applying Corollary [30] stays similar. 
It is clear that the application of Theorem [3] stays valid. 

In the two variants discussed above, we can see that Theorem [4] still applies, and we have exactly the 
same property on 5-hyperbolicity of these variants. We can also combine these two variants, and it is clear 
that our results are still valid. 
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5 Discussions and open problems 



Perhaps the most obvious extension of our results is to close the gap in the bounds on the hyperbolicity in 
the low-dimensional small-world model when 7 is at the "sweetspot," as well as extending the results for 
large 7 to dimensions d > 2. Also of interest is characterizing in more detail the hyperbolicity properties of 
other random graph models, in particular those that have substantial heavy-tailed properties. Finally, exact 
computation of 5 by its definition takes 0(n 4 ) time, which is not scalable to large graphs, and thus the 
design of more efficient exact or approximation algorithms would be of interest. 

From a broader perspective, however, our results suggest that 5 is a measure of tree-like-ness that can be 
quite sensitive to noise in graphs, and in particular to randomness as it is implemented in common network 
generative models. Moreover, our results for the 5 hyperbolicity of rewired trees versus rewired low-5 
tree-like metrics suggest that, while quite appropriate for continuous negatively-curved manifolds, the usual 
definition of 5 may be somewhat less useful for discrete graphs. Thus, it would be of interest to address 
questions such as: does there exist a measure other than Gromov's 5 that is more appropriate for graph-based 
data or more robust to noise/randomness as it is used in popular network generation models; is it possible to 
incorporate in a meaningful way nontrivial randomness in other low <5-hyperbolicity graph families; and can 
we construct non-trivial random graph families that contain as much randomness as possible while having 
low 5-hyperbolicity comparing to graph diameter? 
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